Quit Emailing Yourself

# reasoning → distillation

1 link tagged with all of: reasoning + distillation

Click any tag below to further narrow down your results

Links

GitHub - D2I-ai/dasd-thinking

This article outlines Distribution-Aligned Sequence Distillation, a new pipeline for improving reasoning tasks like math and code generation using minimal training data. It introduces models such as DASD-4B-Thinking and DASD-30B-A3B-Thinking-Preview, which outperform larger models in various benchmarks. The methodology includes temperature-scheduled learning and mixed-policy distillation for better performance.

Saved by tldr-importer · Last saved February 14, 2026 · 5 min read

reasoning ✓ distillation ✓ + models + benchmarks + data-efficiency