Quit Emailing Yourself

# architecture → caching → performance

4 links tagged with all of: architecture + caching + performance

Click any tag below to further narrow down your results

Links

High-Throughput Graph Abstraction at Netflix: Part I | by Netflix Technology Blog | Feb, 2026 | Medium

This article explains Netflix's Graph Abstraction, which is designed to handle high-throughput operational workloads, achieving nearly 10 million operations per second. It details the architecture, data storage strategies, and caching mechanisms that support real-time graph use cases such as social connections and service topology.

Saved by tldr-importer · Last saved February 14, 2026 · 8 min read

+ graph-abstraction + data-storage caching ✓ performance ✓ architecture ✓

What I learned building a vector database on object storage

The article details the author's journey to create a vector database inspired by Turbopuffer's architecture, using Amazon S3 for storage. It covers design challenges, trade-offs, and incremental improvements made during development, focusing on performance and cost-efficiency.

Saved by tldr-importer · Last saved February 14, 2026 · 6 min read

+ vector-database + s3 architecture ✓ performance ✓ caching ✓

Colocating Input Partitions with Kafka Streams When Consuming Multiple Topics: Sub-Topology Matters! | by Vishal Sharma | Expedia Group Technology | Medium

This article discusses how Expedia Group improved their Kafka Streams application by ensuring that identical keys from two topics were processed by the same instance. They faced issues with partition assignment and solved it by using a shared state store, which enhanced caching efficiency and reduced redundant API calls.

Saved by tldr-importer · Last saved February 14, 2026 · 3 min read

+ kafka + kafka-streams caching ✓ architecture ✓ performance ✓

Optimizing LLM Performance with LM Cache: Architectures, Strategies, and Real-World Applications | HackerNoon

The article discusses optimizing large language model (LLM) performance using LM cache architectures, highlighting various strategies and real-world applications. It emphasizes the importance of efficient caching mechanisms to enhance model responsiveness and reduce latency in AI systems. The author, a senior software engineer, shares insights drawn from experience in scalable and secure technology development.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

+ llm performance ✓ caching ✓ + ai architecture ✓