Click any tag below to further narrow down your results
Links
The author shares their shift from using Excel and Google Sheets to DuckDB for handling CSV files. They highlight the simplicity of using SQL for tasks like extracting unique user IDs and exporting data, while also noting the convenience of directly querying various data sources.
The DuckDB-Iceberg extension now supports insert, update, and delete operations for Iceberg v2 tables in version 1.4.2. Users can interact with Iceberg REST Catalogs and manage table properties while utilizing SQL syntax for data manipulation. However, there are limitations regarding updates on partitioned tables and the lack of copy-on-write support.
This article explains how to deploy DuckDB as a WebAssembly module within Cloudflare Workers, enabling SQL queries without a traditional database server. It details the limitations of Cloudflare Workers, the use of Emscripten's Asyncify to handle asynchronous calls, and provides setup and coding instructions for creating a SQL query API.
This article critiques SQL's complexities and inefficiencies while highlighting alternatives like DuckDB. It discusses common frustrations with SQL syntax and suggests ways to enhance usability, including more intuitive commands and error handling.
pg_lake allows Postgres to manage Iceberg tables and interact with data stored in object storage like S3. It supports transactions, various data formats, and utilizes DuckDB for efficient query execution. Users can create, modify, and query data seamlessly within Postgres.
This article explains the optimization rules in DuckDB, focusing on how its advanced optimizer enhances query performance. It details the optimizer's structure, core functions, and how to implement custom optimization rules. A brief overview of 26 built-in optimization rules is also provided.
The author shares their shift from using Excel and Google Sheets to DuckDB and SQL for handling CSV files, highlighting the efficiency of querying data directly. They discuss the benefits of using SQL for data manipulation and invite readers to share their own CSV handling tips.
The article discusses the comparison between DuckDB and Polars, emphasizing that choosing between them depends on the specific context and requirements of the task at hand. It highlights DuckDB as an analytical database focused on SQL queries, while Polars is presented as a fast data manipulation library designed for data processing, akin to Pandas. Ultimately, the author argues that there is no definitive "better" option, and the choice should be driven by the problem being solved.
Sirius is a GPU-native SQL engine that integrates with existing databases like DuckDB using the Substrait query format, achieving approximately 10x speedup over CPU query engines for TPC-H workloads. It is designed for interactive analytics and supports various AWS EC2 instances, with detailed setup instructions for installation and performance testing. Sirius is currently in active development, with plans for additional features and support for more database systems.
The stochastic extension for DuckDB enhances SQL capabilities by adding a range of statistical distribution functions for advanced statistical analysis, probability calculations, and random sampling. Users can install the extension to compute various statistical properties, generate random samples, and perform complex analyses directly within their SQL queries. The extension supports numerous continuous and discrete distributions, making it a valuable tool for data scientists and statisticians.
DuckDB GSheets is an experimental extension that allows users to read and write Google Sheets using SQL commands. It supports authentication through various methods, including access tokens and private keys, enabling seamless integration between DuckDB and Google Sheets. The extension is community-maintained and comes with specific usage guidelines and limitations.
The Tera extension for DuckDB enables powerful template rendering directly within SQL queries, facilitating the generation of dynamic reports, configuration files, HTML, and more. It utilizes the Tera templating engine to allow users to create personalized content and perform data transformations seamlessly from their database environment.
The article explores a creative use of DuckDB's WebAssembly (WASM) capabilities to render the classic video game Doom using SQL queries. It showcases how SQL, typically used for data manipulation, can be leveraged in unconventional ways to create interactive experiences like gaming. The approach highlights the flexibility and power of modern database technologies in innovative applications.