1 link tagged with all of: model-training + machine-learning + open-source + reinforcement-learning
Click any tag below to further narrow down your results
Links
INTELLECT-3 is a Mixture-of-Experts model with over 100 billion parameters, trained using a custom reinforcement learning framework. It outperforms larger models across various benchmarks in math, code, and reasoning. The training infrastructure and datasets are open-sourced for public use and research.