1 link tagged with all of: deepmind + ai + games + benchmarks
Click any tag below to further narrow down your results
Links
Google DeepMind is expanding its Kaggle Game Arena to include benchmarks for social deduction and risk management games like Werewolf and Poker. These additions aim to evaluate AI models on communication, negotiation, and decision-making under uncertainty. The updates also enhance the platform's role in assessing AI behavior in complex environments.