Flow-OPD: On-Policy Distillation for Flow Matching Models

Zhen Fang; Wenxuan Huang; Yu Zeng; Yiming Zhao; Shuang Chen; Kaituo Feng; Yunlong Lin; Lin Chen; Zehui Chen; Shaosheng Cao; and Feng Zhao

arXiv:2605.08063·cs.CV·May 20, 2026

Flow-OPD: On-Policy Distillation for Flow Matching Models

Zhen Fang, Wenxuan Huang, Yu Zeng, Yiming Zhao, Shuang Chen, Kaituo Feng, Yunlong Lin, Lin Chen, Zehui Chen, Shaosheng Cao, and Feng Zhao

PDF

1 Repo 1 Models

TL;DR

Flow-OPD introduces a unified on-policy distillation framework for flow matching models, significantly improving multi-task alignment and image quality in text-to-image generation.

Contribution

It is the first to integrate on-policy distillation into flow matching models, enhancing multi-task alignment and mitigating reward hacking.

Findings

01

GenEval score increased from 63 to 92

02

OCR accuracy improved from 59 to 94

03

Overall performance improved by roughly 10 points over vanilla GRPO

Abstract

Existing Flow Matching (FM) text-to-image models suffer from two critical bottlenecks under multi-task alignment: the reward sparsity induced by scalar-valued rewards, and the gradient interference arising from jointly optimizing heterogeneous objectives, which together give rise to a 'seesaw effect' of competing metrics and pervasive reward hacking. Inspired by the success of On-Policy Distillation (OPD) in the large language model community, we propose Flow-OPD, the first unified post-training framework that integrates on-policy distillation into Flow Matching models. Flow-OPD adopts a two-stage alignment strategy: it first cultivates domain-specialized teacher models via single-reward GRPO fine-tuning, allowing each expert to reach its performance ceiling in isolation; it then establishes a robust initial policy through a Flow-based Cold-Start scheme and seamlessly consolidates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CostaliyA/Flow-OPD
github

Models

🤗
CostaliyA/Flow-OPD
model· 83 dl· ♡ 1
83 dl♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.