Quit Emailing Yourself

# reinforcement-learning → vision-language

2 links tagged with all of: reinforcement-learning + vision-language

Click any tag below to further narrow down your results

Links

Thinking with Map

This article presents a new approach for predicting image locations on Earth by integrating map-based reasoning into large vision-language models. It develops a two-stage optimization method that combines reinforcement learning with test-time scaling to enhance prediction accuracy. The authors introduce MAPBench, a benchmark for evaluating geolocalization performance on real-world images.

Saved by tldr-importer · Last saved February 14, 2026 · 1 min read

+ geolocalization reinforcement-learning ✓ vision-language ✓ + maps + benchmarks

GitHub - wangqinsi1/Vision-Zero: This is the official Python version of Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play.

Vision-Zero is a novel framework that enhances vision-language models (VLMs) through competitive visual games without requiring human-labeled data. It achieves state-of-the-art performance in various reasoning tasks, demonstrating that self-play can effectively improve model capabilities while significantly reducing training costs. The framework supports diverse datasets, including synthetic, chart-based, and real-world images, showcasing its versatility and effectiveness in fine-grained visual reasoning tasks.

Saved by tldr-importer · Last saved October 29, 2025 · 5 min read

vision-language ✓ + self-play reinforcement-learning ✓ + model-training + gamification