Quit Emailing Yourself

# language-models → natural-language-processing → confidence-calibration → label-smoothing

1 link tagged with all of: language-models + natural-language-processing + confidence-calibration + label-smoothing

Calibrated Language Models and How to Find Them with Label Smoothing

The study investigates the impact of instruction tuning on the confidence calibration of large language models (LLMs), revealing significant degradation in calibration post-tuning. It introduces label smoothing as a promising solution to mitigate overconfidence during supervised fine-tuning, while also addressing challenges related to memory consumption in the computation of cross-entropy loss.

Saved by tldr-importer · Last saved October 29, 2025 · 2 min read

language-models ✓ confidence-calibration ✓ label-smoothing ✓ + machine-learning natural-language-processing ✓

Links

Calibrated Language Models and How to Find Them with Label Smoothing