Quit Emailing Yourself

# dataset → reinforcement-learning → multimodal → open-source

1 link tagged with all of: dataset + reinforcement-learning + multimodal + open-source

Click any tag below to further narrow down your results

Links

Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

Mini-o3 introduces an advanced system that enhances tool-based interactions for visual reasoning by supporting deep, multi-turn reasoning and achieving state-of-the-art performance on visual search tasks. The system utilizes a novel over-turn masking strategy to effectively manage response lengths during reinforcement learning, combined with a comprehensive dataset designed for exploratory reasoning. Open-source code and models are provided to facilitate reproducibility and further research.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

+ visual-search multimodal ✓ reinforcement-learning ✓ open-source ✓ dataset ✓