Quit Emailing Yourself

3 links tagged with all of: model-training + vision-language

Click any tag below to further narrow down your results

Links

GitHub - tsunghan-wu/reverse_vlm: 🔥 [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospective Resampling (REVERSE)"

The official repository for the paper "Generate, but Verify" presents the REVERSE model, aimed at reducing hallucinations in vision-language models through retrospective resampling. It provides installation instructions, model checkpoints, and evaluation guidelines, along with acknowledgments to foundational resources from LLaVA and Qwen series.

Saved by tldr-importer · Last saved October 29, 2025 · 2 min read

vision-language ✓ model-training ✓ + hallucination + resampling + repository

GitHub - wangqinsi1/Vision-Zero: This is the official Python version of Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play.

Vision-Zero is a novel framework that enhances vision-language models (VLMs) through competitive visual games without requiring human-labeled data. It achieves state-of-the-art performance in various reasoning tasks, demonstrating that self-play can effectively improve model capabilities while significantly reducing training costs. The framework supports diverse datasets, including synthetic, chart-based, and real-world images, showcasing its versatility and effectiveness in fine-grained visual reasoning tasks.

Saved by tldr-importer · Last saved October 29, 2025 · 5 min read

vision-language ✓ + self-play + reinforcement-learning model-training ✓ + gamification

GitHub - facebookresearch/zero: PyTorch Implementation of Zero-Shot Vision Encoder Grafting via LLM Surrogates [ICCV'25]

The article provides an overview of a codebase for training language and vision-language models using PyTorch, highlighting installation instructions, model inference, and training setup. It details the required dependencies, configuration paths, and methods for integrating new datasets and models, while also addressing the usage of various GPU resources for efficient training and evaluation.

Saved by tldr-importer · Last saved October 29, 2025 · 5 min read

+ pytorch vision-language ✓ model-training ✓ + inference + evaluation