Quit Emailing Yourself

# benchmarks → data-efficiency → reasoning → distillation → models

1 link tagged with all of: benchmarks + data-efficiency + reasoning + distillation + models

Links

GitHub - D2I-ai/dasd-thinking

This article outlines Distribution-Aligned Sequence Distillation, a new pipeline for improving reasoning tasks like math and code generation using minimal training data. It introduces models such as DASD-4B-Thinking and DASD-30B-A3B-Thinking-Preview, which outperform larger models in various benchmarks. The methodology includes temperature-scheduled learning and mixed-policy distillation for better performance.

Saved by tldr-importer · Last saved February 14, 2026 · 5 min read

reasoning ✓ distillation ✓ models ✓ benchmarks ✓ data-efficiency ✓