Click any tag below to further narrow down your results
Links
The author shares their shift from using Excel and Google Sheets to DuckDB for handling CSV files. They highlight the simplicity of using SQL for tasks like extracting unique user IDs and exporting data, while also noting the convenience of directly querying various data sources.
Hannah, a Customer Engineer at MotherDuck, developed a personalized performance summary for her team using SQL. The project compiled metrics like query counts and database creations, assigning playful "duck personas" based on performance. The article outlines the technical steps taken to filter data and generate the final report.
Google introduced BigQuery-managed AI functions that integrate generative AI directly into SQL queries. These functions—AI.IF, AI.CLASSIFY, and AI.SCORE—enable tasks like semantic filtering, data classification, and ranking without complex prompt tuning. This aims to simplify access to AI-driven insights for data practitioners.
This article explains the new support for SQL aggregations in Cloudflare's R2 SQL, which allows users to summarize large datasets effectively. It covers how to use aggregation queries, the importance of pre-aggregates, and introduces the concepts of scatter-gather and shuffling for efficient data processing.
The author shares their shift from using Excel and Google Sheets to DuckDB and SQL for handling CSV files, highlighting the efficiency of querying data directly. They discuss the benefits of using SQL for data manipulation and invite readers to share their own CSV handling tips.
The article discusses the concept of temporal joins, which allow for querying time-based data across different tables in a database. It covers the importance of temporal data in applications and provides examples of how to implement temporal joins effectively. Additionally, it highlights the benefits of using these joins for better data analysis and insights.