2 links tagged with all of: model-training + machine-learning + open-source
Click any tag below to further narrow down your results
Links
INTELLECT-3 is a Mixture-of-Experts model with over 100 billion parameters, trained using a custom reinforcement learning framework. It outperforms larger models across various benchmarks in math, code, and reasoning. The training infrastructure and datasets are open-sourced for public use and research.
The SpecForge team, in partnership with industry leaders, has launched SpecBundle (Phase 1), a collection of production-ready EAGLE-3 model checkpoints aimed at enhancing speculative decoding in large language models. This release addresses the lack of accessible tools and high-quality draft models, while SpecForge v0.2 introduces major usability upgrades and multi-backend support for improved performance.