Quit Emailing Yourself

How Instacart Built a Modern Search Infrastructure on Postgres

Instacart has developed a modern search infrastructure on Postgres to enhance their search capabilities by integrating traditional full-text search with embedding-based retrieval. This hybrid approach addresses challenges such as overfetching, precision and recall control, and operational burdens, resulting in improved relevance, performance, and scalability for their extensive catalog of grocery items.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

+ search postgres ✓ + hybrid-retrieval + architecture + scalability

[no-title]

The article discusses enhancements and changes introduced in PostgreSQL 18, specifically focusing on the RETURNING clause. It highlights new features that improve functionality and performance, making it easier for developers to retrieve data after insert, update, or delete operations. The author also compares these enhancements with previous versions, showcasing the evolution of PostgreSQL capabilities.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

postgres ✓ + returning-clause + database + features + updates

[no-title]

The blog post discusses a postmortem analysis of a significant corruption issue experienced with the PostgreSQL database system, detailing the causes, impacts, and lessons learned from the incident. It emphasizes the importance of robust data integrity measures and the need for improved monitoring and response strategies in database management systems.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

postgres ✓ + corruption + database + postmortem + data-integrity

Streamline Structured + Unstructured Data Flows from Postgres with AI | HackerNoon

CocoIndex is a framework designed to streamline incremental data flows that integrate both structured and unstructured data sources, particularly using PostgreSQL. It allows for uniform handling of data operations, including AI transformations like generating embeddings, while ensuring operational simplicity and data consistency. The example provided demonstrates how to read, transform, and store product data efficiently for semantic search capabilities.

Saved by tldr-importer · Last saved October 29, 2025 · 4 min read

+ cocoindex postgres ✓ + data-transformation + ai + semantic-search

https://www.crunchydata.com/blog/the-postgres-project-original-goals-and-how-the-creators-totally-nailed-it

The article explores the original goals of the Postgres project and highlights how its creators successfully achieved these objectives. It discusses the foundational principles that guided the development of Postgres and its evolution into a robust database system known for its reliability and advanced features.

Saved by tldr-importer · Last saved October 29, 2025 · 2 min read

postgres ✓ + database + open-source + technology + development

Interesting Bits of Postgres Grammar

The article explores unique features of PostgreSQL grammar, focusing on custom operators, precedence in compound selects, and various syntax nuances such as string continuation, quoted identifiers, and Unicode escapes. It highlights how these aspects can enhance functionality while also presenting challenges for implementation.

Saved by tldr-importer · Last saved October 29, 2025 · 2 min read

postgres ✓ + grammar + custom-operators + sql + syntax

The Fastest Way to Insert Data to Postgres - Confessions of a Data Guy

To efficiently insert large datasets into a Postgres database, combining Spark's parallel processing with Python's COPY command can significantly enhance performance. By repartitioning the data and utilizing multiple writers, the author was able to insert 22 million records in under 14 minutes, leveraging Postgres's bulk-loading capabilities over traditional JDBC methods.

Saved by tldr-importer · Last saved October 29, 2025 · 3 min read

postgres ✓ + spark + data-ingestion + performance + python

Mastering Postgres Replication Slots: Preventing WAL Bloat and Other Production Issues

Managing replication slots in Postgres is crucial to prevent WAL bloat and ensure efficient Change Data Capture (CDC) processes. Best practices include using the pgoutput plug-in, defining maximum replication slot sizes, enabling heartbeats for idle databases, and utilizing table-level publications and filters to optimize resource usage. These strategies help maintain database performance and avoid operational issues.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

postgres ✓ + replication + wal + cdc + best-practices

GitHub - simstudioai/sim: Open-source platform to build and deploy AI agent workflows.

Build and deploy AI agent workflows quickly using Sim, a cloud-hosted service that requires Docker and PostgreSQL with the pgvector extension. The article details the installation process, including commands for setting up the application and running it with local AI models. It also covers the necessary configurations for development environments and offers options for using PostgreSQL.

Saved by tldr-importer · Last saved October 29, 2025 · 2 min read

+ ai + docker + workflow postgres ✓ + development

Adding PostgreSQL State Backend Support to Pulumi: A Community Contribution Journey

PostgreSQL has been integrated as a DIY state backend option for Pulumi, providing teams with a reliable alternative to traditional object storage solutions. This community-driven contribution enhances state management by offering features like ACID compliance, large object support, and improved performance for smaller state files, while still highlighting the benefits of using Pulumi Cloud for enterprise needs. Future enhancements for the PostgreSQL backend include high availability and multi-tenant support.

Saved by tldr-importer · Last saved October 29, 2025 · 4 min read

postgres ✓ + pulumi + backend + open-source + infrastructure

[no-title]

The article discusses the implementation of direct TLS (Transport Layer Security) connections for PostgreSQL databases, emphasizing the importance of secure data transmission. It outlines the necessary configurations and steps to enable TLS, enhancing the security posture of database communications. Best practices for managing certificates and connections are also highlighted to ensure a robust security framework.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

postgres ✓ + tls + security + database + encryption

[no-title]

The article discusses the advantages of using Redis for caching in applications, particularly in conjunction with Postgres for data storage. It highlights Redis's speed and efficiency in handling cache operations, which can significantly improve application performance. Additionally, it addresses potential pitfalls and best practices for integrating Redis with existing systems.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

+ redis + caching postgres ✓ + performance + best-practices

https://byteofdev.com/posts/making-postgres-slow/

The article discusses various methods to intentionally slow down PostgreSQL databases for testing purposes. It explores different configurations and practices to simulate performance degradation, aiding developers in understanding how their applications behave under stress. This approach helps in identifying potential bottlenecks and preparing for real-world scenarios.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

postgres ✓ + performance + testing + database + optimization

[no-title]

The article discusses the integration of Multigres with Vitess for PostgreSQL databases, highlighting its capabilities to enhance performance and scalability. It emphasizes the benefits of using Vitess to manage large-scale workloads and improve database efficiency.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

+ multigres + vitess postgres ✓ + database + scalability

Lakebase: Databricks’ Bold Play to Fuse OLTP and the Lakehouse - Confessions of a Data Guy

Databricks has introduced Lakebase, a fully managed PostgreSQL OLTP engine integrated into its platform, aimed at bridging the gap between OLTP and OLAP systems. This development offers features like Postgres compatibility, unified security, and elastic storage, potentially streamlining operations for teams already using Databricks. However, its impact may vary for organizations not heavily invested in the Databricks ecosystem.

Saved by tldr-importer · Last saved October 29, 2025 · 2 min read

+ databricks + lakehouse + oltp + olap postgres ✓

[no-title]

The article introduces the new pg_textsearch feature in PostgreSQL, which utilizes true BM25 ranking for enhanced hybrid retrieval capabilities. This update aims to improve search relevance and efficiency within the database, making it a valuable tool for developers and data analysts.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

postgres ✓ + text-search + bm25 + retrieval + database

Postgres CDC with Debezium: Complete tutorial

This tutorial guides users through setting up a complete Change Data Capture (CDC) pipeline using Debezium and Kafka Connect to stream changes from a PostgreSQL database. It covers the prerequisites, infrastructure setup with Docker, PostgreSQL configuration, connector registration, and observing change events in Kafka topics.

Saved by tldr-importer · Last saved October 29, 2025 · 5 min read

postgres ✓ + debezium + kafka + cdc + docker

[no-title]

The article discusses the advantages of indexing JSONB data types in PostgreSQL, emphasizing improved query performance and efficient data retrieval. It provides practical examples and techniques for creating indexes, as well as considerations for maintaining performance in applications that utilize JSONB fields.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

+ jsonb postgres ✓ + indexing + performance + database

[no-title]

The article discusses techniques for enhancing query performance in PostgreSQL by manipulating its statistics tables. It explains how to use these statistics effectively to optimize query planning and execution, ultimately leading to faster data retrieval. Practical examples and insights into the PostgreSQL system are provided to illustrate these methods.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

postgres ✓ + performance + query-optimization + statistics + database

[no-title]

The podcast episode features Aaron Katz and Sai Krishna Srirampur discussing the transition from Postgres to ClickHouse, highlighting how this shift simplifies the modern data stack. They explore the benefits of ClickHouse's architecture for analytics and performance in data-driven environments.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

postgres ✓ + clickhouse + data-stack + analytics + performance

What's New in PostgreSQL 18 - a DBA's Perspective

PostgreSQL 18 introduces significant enhancements, including a new asynchronous I/O subsystem for improved performance, native support for UUIDv7 for better indexing, and improved output for the EXPLAIN command. Additionally, it streamlines major version upgrades and offers new capabilities for handling NOT NULL constraints and RETURNING statements.

Saved by tldr-importer · Last saved October 29, 2025 · 5 min read

postgres ✓ + database + features + upgrade + performance

GitHub - plv8/pljs: PLJS - Javascript Language Plugin for PostgreSQL

PLJS is a lightweight and efficient JavaScript Language Extension for PostgreSQL, compatible with version 14 and above. Users can easily install and use it by executing specific SQL commands, and they can find support through a dedicated Discord channel. The extension includes various features such as type conversion, configuration options, and a roadmap for future development.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

+ pljs postgres ✓ + javascript + extension + development

We Made Postgres Writes Faster, but it Broke Replication

The implementation of Log-Structured Merge (LSM) trees in Postgres aimed to enhance write throughput for real-time applications but resulted in issues with physical replication. The article explores the challenges of ensuring replication safety and the role of hot_standby_feedback in mitigating logical consistency problems during high-volume write operations.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

postgres ✓ + lsm-trees + replication + consistency + hot-standby-feedback

Just Throw It Into Postgres

Embracing a flexible approach to data storage, the article advocates for using PostgreSQL to store various types of data without overthinking their structure. It highlights the advantages of saving raw data in a database, allowing for easier modifications and queries over time, illustrated through examples like Java IDE indexing, Chinese character storage, and sensor data logging.

Saved by tldr-importer · Last saved October 29, 2025 · 5 min read

postgres ✓ + data-storage + json + indexing + flexibility

[no-title]

The article discusses the exciting new features and improvements introduced in PostgreSQL 18, highlighting enhancements in performance, security, and usability. It emphasizes how these updates position PostgreSQL as a leading database solution for developers and businesses alike. Additionally, the blog encourages readers to explore the potential of PostgreSQL in their projects and applications.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

postgres ✓ + database + features + updates + performance

GitHub - hatchet-dev/hatchet: 🪓 Run Background Tasks at Scale

Hatchet is a background task management platform that simplifies the process of running and distributing functions across workers using Postgres. It offers features such as task orchestration, flow control, scheduling, and a real-time dashboard, making it easier to manage complex workflows and ensure task durability without the complications of traditional task queues.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

+ background-tasks + task-orchestration + workflow-management postgres ✓ + scheduling

Double-Entry Ledgers: The Missing Primitive in Modern Software

Double-entry ledger modeling is underutilized in modern software development, despite its potential to simplify tracking financial transactions and other amount changes. By implementing a ledger system, developers can create a more robust and auditable way to manage various accounts, payments, and even user points, reducing complexity in their applications. Using a ledger can streamline data handling and improve error-checking across different use cases.

Saved by tldr-importer · Last saved October 29, 2025 · 7 min read

+ ledgers + software + accounting + data-modeling postgres ✓

PG Linter Documentation¶

PG Linter is a PostgreSQL extension designed to analyze databases for potential issues, performance problems, and best practice violations, targeting developers and operations teams without deep DBA knowledge. It features a rule-based approach for customizable checks across various categories, including performance analysis, schema validation, and security auditing, and integrates seamlessly into development workflows.

Saved by tldr-importer · Last saved October 29, 2025 · 3 min read

+ pglinter postgres ✓ + database-analysis + ci-cd + best-practices

Migrating to Postgres

Motion transitioned from CockroachDB to Postgres due to escalating costs and operational challenges, particularly with migrations and ETL processes. The migration revealed better performance with Postgres for many queries, despite some initial advantages of Cockroach’s query planner. The move ultimately streamlined operations and resolved numerous UI and support issues experienced with CockroachDB.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

postgres ✓ + cockroachdb + migration + database + performance

Postgres LISTEN/NOTIFY does not scale

Recall.ai faced significant performance issues with their Postgres database due to the high concurrency of NOTIFY commands used during transactions, which caused global locks and serialized commits, leading to downtime. After investigating, they discovered that the LISTEN/NOTIFY feature did not scale well under their workload of tens of thousands of simultaneous writers. They advise against using LISTEN/NOTIFY in high-write scenarios to maintain database performance and scalability.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

postgres ✓ + performance + concurrency + database + scaling

Backfilling Postgres TOAST Columns in Debezium Data Change Events

Postgres logical replication can struggle with TOAST columns, leading to incomplete change events in Debezium when values remain unchanged. This article examines Debezium's reselect post processor as a solution, alongside more comprehensive approaches using Apache Flink for stateful stream processing to manage TOAST column values effectively.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

postgres ✓ + debezium + toast + flink + replication

One more reason to choose Postgres over MySQL

Postgres is favored over MySQL for its support of Transactional DDL, allowing multiple schema changes to be rolled back if any fail, which is essential for effective database migration. The author highlights the complications that arise when down scripts are not properly tested, emphasizing Postgres's advantage in ensuring schema integrity during migrations. This functionality makes Postgres a superior choice for developers working with frameworks like PlayFramework.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

postgres ✓ + mysql + transactional-ddl + database-migration + playframework

GitHub - victoriano/notion2pg: Notion to Postgres sync pipeline with Dagster orchestration

A reproducible stack for ingesting Notion databases into a Postgres warehouse is outlined, utilizing Dagster for orchestration and monitoring. The setup requires Docker Compose, and detailed steps for integration with Notion, environment configuration, and service deployment are provided. Additionally, users can manage pipeline states and backups while enabling scheduled runs and real-time logging.

Saved by tldr-importer · Last saved October 29, 2025 · 2 min read

+ notion postgres ✓ + dagster + docker + data-integration

[no-title]

The article discusses the PostgreSQL wire protocol, providing insights into how the protocol operates and its significance for database communication. It delves into various aspects of the protocol, including its structure and features, aimed at enhancing the understanding of developers and database administrators.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

postgres ✓ + wire-protocol + database + communication + development

Postgres Replication Slots: Confirmed Flush LSN vs. Restart LSN

Postgres replication slots utilize two log sequence numbers (LSNs) — confirmed_flush_lsn and restart_lsn — to manage data streaming and retention effectively. The confirmed_flush_lsn indicates the last acknowledged data by the consumer, while the restart_lsn serves as a retention boundary for WAL segments needed for ongoing transactions. Understanding these differences is essential for troubleshooting replication issues and optimizing WAL retention in production environments.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

postgres ✓ + replication + lsn + wal + database

Fiber v3 Services + Testcontainers: Next-Gen Local Dev in Go

Local development can be streamlined with Fiber v3's new Services API and Testcontainers, allowing developers to manage external service dependencies like PostgreSQL containers seamlessly within their applications. This integration promotes a consistent development environment, enhances testing capabilities, and simplifies the lifecycle management of backing services. The article provides a guide on building a Fiber app with these new features for improved local development experiences.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

+ fiber + testcontainers postgres ✓ + local-development + go

Handling unique indexes on large data in PostgreSQL

Unique indexes in PostgreSQL have a limitation where entries larger than 1/3 of a buffer page (~2.7KB) cannot be indexed, particularly affecting large text fields. To enforce uniqueness on large data, a common workaround is to create a hash of the data and index that instead, allowing for efficient comparisons without exceeding size constraints. The article explains the reasons behind these constraints and offers a practical solution using hash functions.

Saved by tldr-importer · Last saved October 29, 2025 · 7 min read

postgres ✓ + unique-indexes + data-storage + hashing + performance

[no-title]

The article explores a mysterious issue related to PostgreSQL's handling of SIGTERM signals, which can lead to unexpected behavior during shutdown. It discusses the implications of this behavior on database performance and reliability, particularly in the context of modern cloud architectures. The author highlights the importance of understanding these nuances to avoid potential pitfalls in database management.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

postgres ✓ + sigterm + database + performance + reliability

A roadmap to scaling Postgres

The article discusses the evolving strategies for scaling PostgreSQL databases, emphasizing the importance of understanding Postgres internals, effective data modeling, and the appropriate use of indexing. It also covers hardware considerations, configuration tuning, partitioning, and the potential benefits of managed database services, while warning against common pitfalls like over-optimization and neglected maintenance practices.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

postgres ✓ + scaling + indexing + configuration + partitioning

[no-title]

The article discusses how to archive PostgreSQL partitions to Apache Iceberg, highlighting the benefits of using Iceberg for managing large datasets and improving query performance. It outlines the steps necessary for implementing this archiving process and emphasizes the efficiency gained through Iceberg's table format.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

postgres ✓ + iceberg + partitioning + archiving + data-management

Links