GenEvolve: Self-Evolving Image Generation Agents via Tool-Orchestrated Visual Experience Distillation

Sixiang Chen; Zhaohu Xing; Tian Ye; Xinyu Geng; Yunlong Lin; Jianyu Lai; Xuanhua He; Fuxiang Zhai; Jialin Gao; Lei Zhu

arXiv:2605.21605·cs.CV·May 22, 2026

GenEvolve: Self-Evolving Image Generation Agents via Tool-Orchestrated Visual Experience Distillation

Sixiang Chen, Zhaohu Xing, Tian Ye, Xinyu Geng, Yunlong Lin, Jianyu Lai, Xuanhua He, Fuxiang Zhai, Jialin Gao, Lei Zhu

PDF

2 Repos 1 Models 1 Datasets

TL;DR

GenEvolve is a self-evolving image generation framework that leverages tool-orchestrated trajectories and visual experience distillation to improve image quality and diversity across varied challenges.

Contribution

It introduces a novel self-evolving approach combining structured visual experience with on-policy self-distillation for enhanced image generation.

Findings

01

Achieves state-of-the-art performance on public benchmarks.

02

Substantial gains over strong baselines.

03

Effective use of structured visual experience for training.

Abstract

Open-ended image generation is no longer a simple prompt-to-image problem. High-quality generation often requires an agent to combine a model's internal generative ability with external resources. As requests become more diverse and demanding, we aim to develop a general image-generation agent that can self-evolve through trajectories and use tools more effectively across varied generation challenges. To this end, we propose GenEvolve, a self-evolving framework based on Tool-Orchestrated Visual Experience Distillation. In GenEvolve, each generation attempt is modeled as a tool-orchestrated trajectory, where the agent gathers evidence, selects references, invokes generation skills, and composes them into a prompt-reference program. Unlike existing agentic generation methods that mainly rely on image-level scalar rewards, GenEvolve compares multiple trajectories for the same request and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
MeiGen-AI/GenEvolve
model· 146 dl· ♡ 6
146 dl♡ 6

Datasets

MeiGen-AI/GenEvolve-Data-Bench
dataset· 8.1k dl
8.1k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.