Quit Emailing Yourself

# language-models → floating-point → reproducibility → inference

1 link tagged with all of: language-models + floating-point + reproducibility + inference

Defeating Nondeterminism in LLM Inference

Achieving reproducibility in large language model (LLM) inference is challenging due to inherent nondeterminism, often attributed to floating-point non-associativity and concurrency issues. However, most kernels in LLMs do not require atomic adds, which are a common source of nondeterminism, suggesting that the causes of variability in outputs are more complex. The article explores these complexities and offers insights into obtaining truly reproducible results in LLM inference.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

+ nondeterminism reproducibility ✓ floating-point ✓ inference ✓ language-models ✓

Links

Defeating Nondeterminism in LLM Inference