Region-Constrained Group Relative Policy Optimization for Flow-Based Image Editing

Zhuohan Ouyang; Zhe Qian; Wenhuo Cui; Chaoqun Wang

arXiv:2604.09386·cs.CV·April 13, 2026

Region-Constrained Group Relative Policy Optimization for Flow-Based Image Editing

Zhuohan Ouyang, Zhe Qian, Wenhuo Cui, Chaoqun Wang

PDF

TL;DR

This paper introduces RC-GRPO-Editing, a region-constrained framework for flow-based image editing that improves instruction adherence and content preservation by reducing background noise and aligning attention.

Contribution

It proposes a novel region-constrained post-training method that localizes exploration and aligns attention to enhance editing precision in flow-based models.

Findings

01

Improves editing region instruction adherence.

02

Enhances non-target content preservation.

03

Reduces reward variance and stabilizes advantages.

Abstract

Instruction-guided image editing requires balancing target modification with non-target preservation. Recently, flow-based models have emerged as a strong and increasingly adopted backbone for instruction-guided image editing, thanks to their high fidelity and efficient deterministic ODE sampling. Building on this foundation, GRPO-based reward-driven post-training has been explored to directly optimize editing-specific rewards, improving instruction following and editing consistency. However, existing methods often suffer from noisy credit assignment: global exploration also perturbs non-target regions, inflating within-group reward variance and yielding noisy GRPO advantages. To address this, we propose RC-GRPO-Editing, a region-constrained GRPO post-training framework for flow-based image editing under deterministic ODE sampling. It suppresses background-induced nuisance variance to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.