FAIL: Flow Matching Adversarial Imitation Learning for Image Generation

Yeyao Ma; Chen Li; Xiaosong Zhang; Han Hu; Weidi Xie

arXiv:2602.12155·cs.CV·February 13, 2026

FAIL: Flow Matching Adversarial Imitation Learning for Image Generation

Yeyao Ma, Chen Li, Xiaosong Zhang, Han Hu, Weidi Xie

PDF

Open Access 1 Datasets

TL;DR

FAIL introduces a novel adversarial imitation learning framework for image generation that aligns policies with expert demonstrations without explicit rewards, improving performance and robustness in various generative tasks.

Contribution

The paper presents FAIL, a new adversarial imitation learning method for image generation that avoids reward modeling and preference pairs, with algorithms suitable for different computational settings.

Findings

01

FAIL achieves competitive prompt following performance.

02

The framework generalizes to image and video generation.

03

Fails acts as a regularizer to reduce reward hacking.

Abstract

Post-training of flow matching models-aligning the output distribution with a high-quality target-is mathematically equivalent to imitation learning. While Supervised Fine-Tuning mimics expert demonstrations effectively, it cannot correct policy drift in unseen states. Preference optimization methods address this but require costly preference pairs or reward modeling. We propose Flow Matching Adversarial Imitation Learning (FAIL), which minimizes policy-expert divergence through adversarial training without explicit rewards or pairwise comparisons. We derive two algorithms: FAIL-PD exploits differentiable ODE solvers for low-variance pathwise gradients, while FAIL-PG provides a black-box alternative for discrete or computationally constrained settings. Fine-tuning FLUX with only 13,000 demonstrations from Nano Banana pro, FAIL achieves competitive performance on prompt following and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

HansPolo/FAIL-expert-data
dataset· 12 dl
12 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning