Quit Emailing Yourself

# python → sql

3 links tagged with all of: python + sql

Click any tag below to further narrow down your results

Links

Spark Declarative Pipelines Programming Guide

This article explains Spark Declarative Pipelines (SDP), a framework for creating data pipelines in Spark. It covers key concepts like flows, datasets, and pipelines, along with how to implement them in Python and SQL. The guide also includes installation instructions and usage of the command line interface.

Saved by tldr-importer · Last saved February 14, 2026 · 5 min read

+ spark + pipelines + data-processing sql ✓ python ✓

GitHub - bruin-data/bruin: Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.

Bruin is a data pipeline tool that integrates data ingestion, transformation, and quality checks into one framework. It supports SQL, Python, and R while working across major data platforms, whether on a local machine or cloud services like EC2. The tool offers built-in features like Jinja templating and data validation for streamlined workflows.

Saved by tldr-importer · Last saved February 14, 2026 · 1 min read

+ data-pipeline + ingestion + transformation sql ✓ python ✓

GitHub - Eventual-Inc/Daft: Distributed query engine providing simple and reliable data processing for any modality and scale

Daft is a distributed query engine designed for large-scale data processing using Python or SQL, built with Rust. It offers a familiar interactive API, powerful query optimization, and seamless integration with data catalogs and multimodal types, making it suitable for complex data operations in cloud environments. Daft supports interactive and distributed computing, allowing users to efficiently handle diverse data types and perform operations across large clusters.

Saved by tldr-importer · Last saved October 29, 2025 · 3 min read

+ data-processing + distributed-computing python ✓ sql ✓ + multimodal