Quit Emailing Yourself

[2510.19808] Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing

The article introduces the Pico-Banana-400K dataset, a large-scale collection of 400,000 images designed for text-guided image editing. It aims to address the limitations in existing datasets by providing high-quality, diverse edit pairs generated from real photographs, facilitating advanced research in multimodal image editing techniques. The dataset includes specialized subsets for multi-turn editing, preference research, and instruction summarization.

Saved by hn_user_1 · 2 others saved this · Last saved October 28, 2025 · 3 min read

dataset ✓ + image editing + multimodal

GitHub - apple/pico-banana-400k

The article presents the Pico-Banana-400K dataset, which consists of approximately 400,000 text-image-edit triplets aimed at enhancing research in text-guided image editing. It features a variety of edit operations across multiple semantic categories, with evaluations conducted using advanced AI models to ensure high-quality edits. This dataset is designed to support both single-step and multi-turn editing applications.

Saved by hn_user_2 · 1 other saved this · Last saved October 28, 2025 · 3 min read

dataset ✓ + image editing + text-guided + text guidance

Links

[2510.19808] Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing

GitHub - apple/pico-banana-400k