SmolLM3 is a new competitive 3B multilingual language model designed for efficient deployment, outperforming similar models while maintaining a focus on long-context reasoning. It incorporates innovative architectural changes and a thorough training methodology, including a three-stage data mixture approach and dual mode reasoning capabilities for enhanced user interaction. The complete engineering blueprint is shared to facilitate model reproduction and understanding of its performance drivers.
smollm3 ✓
language-model ✓
+ multilingual
reasoning ✓
training-methodology ✓