Quit Emailing Yourself

# optimization → arithmetic-intensity → performance

1 link tagged with all of: optimization + arithmetic-intensity + performance

Basic facts about GPUs

The article explores the workings of GPUs, focusing on key performance factors such as compute and memory hierarchy, performance regimes, and strategies for optimization. It highlights the imbalance between computational speed and memory bandwidth, using the NVIDIA A100 GPU as a case study, and discusses techniques like data fusion and tiling to enhance performance. Additionally, it addresses the importance of arithmetic intensity in determining whether operations are memory-bound or compute-bound.

Saved by tldr-importer · Last saved October 29, 2025 · 7 min read

+ gpu performance ✓ optimization ✓ arithmetic-intensity ✓ + memory-bandwidth

Links

Basic facts about GPUs