2 links tagged with all of: performance + latency + optimization
Click any tag below to further narrow down your results
Links
Jeff Dean outlines essential timing metrics for various computing tasks. The list includes latencies for cache references, memory accesses, and network communications, providing clear benchmarks for developers. Understanding these numbers helps optimize performance in software engineering.
The article discusses the transformation of a batch machine learning inference system into a real-time system to handle explosive user growth, achieving a 5.8x reduction in latency and maintaining over 99.9% reliability. Key optimizations included migrating to Redis for faster data access, compiling models to native C binaries, and implementing gRPC for improved data transmission. These changes enabled the system to serve millions of predictions quickly while capturing significant revenue that would have otherwise been lost.