2 links tagged with all of: reinforcement-learning + research
Click any tag below to further narrow down your results
Links
NitroGen is an open-source model designed for creating gaming agents that can learn from internet videos. It takes pixel input from games and predicts gamepad actions but currently has limitations, such as only processing the last frame and lacking long-term planning abilities. Users must provide their own game copies to run the model on Windows.
This article explores the dynamic work environment at MiniMax, focusing on the challenges and breakthroughs in their reinforcement learning models. Senior researcher Olive Song discusses the importance of real-time collaboration between developers and researchers, and the lessons learned from unexpected model behaviors.