Quit Emailing Yourself

# reinforcement-learning → visual-reasoning

2 links tagged with all of: reinforcement-learning + visual-reasoning

Click any tag below to further narrow down your results

Links

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

This paper introduces a novel method for enhancing visual reasoning that relies on self-improvement and minimizes the number of training samples needed. By utilizing Monte Carlo Tree Search to quantify sample difficulty, the authors effectively filter a large dataset down to 11k challenging samples, leading to significant performance improvements of their model, ThinkLite-VL, over existing models. Evaluation results demonstrate a 7% increase in average performance, achieving state-of-the-art accuracy on several benchmarks.

Saved by tldr-importer · Last saved October 29, 2025 · 2 min read

visual-reasoning ✓ + monte-carlo-tree-search + data-efficiency reinforcement-learning ✓ + self-improvement

Thyme: Think Beyond Images

Thyme introduces a groundbreaking approach to image processing by autonomously generating and executing code for complex visual reasoning tasks. Utilizing a two-stage training strategy that combines supervised fine-tuning and reinforcement learning, along with the innovative GRPO-ATS algorithm, it effectively enhances performance in high-resolution perception.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

+ thyme + image-processing visual-reasoning ✓ + machine-learning reinforcement-learning ✓