Click any tag below to further narrow down your results
Links
This article details how OpenAI scaled PostgreSQL to handle the massive traffic from 800 million ChatGPT users. It discusses the challenges faced during high write loads, optimizations made to reduce strain on the primary database, and strategies for maintaining performance under heavy demand.
This article discusses how Recall.ai faced delays in PostgreSQL connections during high-load meeting spikes. The issue stemmed from the single-threaded nature of the postmaster process, which struggled to handle the surge in connection requests, leading to significant latency.
Pinecone's Dedicated Read Nodes (DRN) offer exclusive infrastructure for high-demand applications, providing predictable performance and cost. They allow for dedicated capacity without the interference of other workloads, making them suitable for tasks like semantic search and real-time recommendation systems. Users can scale their workloads easily by adjusting replicas and shards.
Patreon faced challenges in scaling its infrastructure for live events, necessitating cross-team collaboration to quantify capacity and optimize performance. Through careful analysis and prioritization of app requests, they focused on reducing load and enhancing user experience while maintaining system reliability. Key learnings emphasized the importance of optimizing both client and server aspects to achieve scalability.
The article discusses the importance of focusing on qualitative metrics rather than purely quantitative ones for scaling businesses. It emphasizes that traditional metrics may not accurately reflect a company's growth potential and encourages a deeper understanding of what drives success. The author argues for a holistic approach to evaluating performance and making strategic decisions.
Recall.ai faced significant performance issues with their Postgres database due to the high concurrency of NOTIFY commands used during transactions, which caused global locks and serialized commits, leading to downtime. After investigating, they discovered that the LISTEN/NOTIFY feature did not scale well under their workload of tens of thousands of simultaneous writers. They advise against using LISTEN/NOTIFY in high-write scenarios to maintain database performance and scalability.