AdaGen: Learning Adaptive Policy for Image Synthesis
Zanlin Ni, Yulin Wang, Yeguo Hua, Renping Zhou, Jiayi Guo, Jun Song, Bo Zheng, Gao Huang

TL;DR
AdaGen introduces a learnable, adaptive scheduling framework for iterative image synthesis, improving quality and efficiency across various generative models by using reinforcement learning and adversarial rewards.
Contribution
It proposes a novel, sample-adaptive scheduling method for image synthesis that outperforms static schedules and manual tuning, with a reinforcement learning approach and adversarial reward design.
Findings
Achieves better FID scores with lower inference costs
Demonstrates effectiveness across multiple generative paradigms
Provides flexible control over fidelity-diversity trade-off
Abstract
Recent advances in image synthesis have been propelled by powerful generative models, such as Masked Generative Transformers (MaskGIT), autoregressive models, diffusion models, and rectified flow models. A common principle behind their success is the decomposition of synthesis into multiple steps. However, this introduces a proliferation of step-specific parameters (e.g., noise level or temperature at each step). Existing approaches typically rely on manually-designed rules to manage this complexity, demanding expert knowledge and trial-and-error. Furthermore, these static schedules lack the flexibility to adapt to the unique characteristics of each sample, yielding sub-optimal performance. To address this issue, we present AdaGen, a general, learnable, and sample-adaptive framework for scheduling the iterative generation process. Specifically, we formulate the scheduling problem as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Music Technology and Sound Studies · Embedded Systems Design Techniques
