2 links tagged with all of: benchmarks + ai + machine-learning + open-source
Click any tag below to further narrow down your results
Links
GLM-5 is a new model designed for complex systems engineering and long-horizon tasks, boasting 744 billion parameters and improved training efficiency. It outperforms its predecessor, GLM-4.7, on various benchmarks and is capable of generating professional documents directly from text.
Moonshot AI's Kimi K2 model outperforms GPT-4 in several benchmark tests, showcasing superior capabilities in autonomous task execution and mathematical reasoning. Its innovative MuonClip optimizer promises to revolutionize AI training efficiency, potentially disrupting the competitive landscape among major AI providers.