Quit Emailing Yourself

3 links tagged with all of: infrastructure + optimization

Click any tag below to further narrow down your results

Links

From Thundering Herds to Smooth Streams: How Patreon Scaled For Live Events

Patreon faced challenges in scaling its infrastructure for live events, necessitating cross-team collaboration to quantify capacity and optimize performance. Through careful analysis and prioritization of app requests, they focused on reducing load and enhancing user experience while maintaining system reliability. Key learnings emphasized the importance of optimizing both client and server aspects to achieve scalability.

Saved by tldr-importer · Last saved October 29, 2025 · 9 min read

+ scaling + performance infrastructure ✓ optimization ✓ + collaboration

Scaling Large Language Model Serving Infrastructure at Meta

Charlotte Qi discusses the challenges of serving large language models (LLMs) at Meta, focusing on the complexities of LLM inference and the need for efficient hardware and software solutions. She outlines the critical steps to optimize LLM serving, including fitting models to hardware, managing latency, and leveraging techniques like continuous batching and disaggregation to enhance performance.

Saved by tldr-importer · Last saved October 29, 2025 · 7 min read

+ llm + inference optimization ✓ + meta infrastructure ✓

Scaling Pinterest ML Infrastructure with Ray: From Training to End-to-End ML Pipelines

Pinterest has enhanced its machine learning (ML) infrastructure by extending the capabilities of Ray beyond just training and inference. By addressing challenges such as slow data pipelines and inefficient compute usage, Pinterest implemented a Ray-native ML infrastructure that improves feature development, sampling, and labeling, leading to faster, more scalable ML iteration.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

+ machine-learning + ray infrastructure ✓ optimization ✓ + data-processing