CoCoEdit: Content-Consistent Image Editing via Region Regularized Reinforcement Learning

Yuhui Wu; Chenxi Xie; Ruibin Li; Liyi Chen; Qiaosi Yi; Lei Zhang

arXiv:2602.14068·cs.CV·May 15, 2026

CoCoEdit: Content-Consistent Image Editing via Region Regularized Reinforcement Learning

Yuhui Wu, Chenxi Xie, Ruibin Li, Liyi Chen, Qiaosi Yi, Lei Zhang

PDF

1 Datasets

TL;DR

CoCoEdit is a reinforcement learning framework that enhances image editing by ensuring content consistency and precise region control, outperforming existing models in quality and stability.

Contribution

The paper introduces a novel region regularized reinforcement learning approach with pixel-level rewards and dataset augmentation for content-consistent image editing.

Findings

01

Achieves competitive editing scores with state-of-the-art models.

02

Significantly improves content consistency as measured by PSNR/SSIM.

03

Outperforms existing models in human subjective ratings.

Abstract

Image editing has achieved impressive results with the development of large-scale generative models. However, existing models mainly focus on the editing effects of intended objects and regions, often leading to unwanted changes in unintended regions. We present a post-training framework for Content-Consistent Editing (CoCoEdit) via region regularized reinforcement learning. We first augment existing editing datasets with refined instructions and masks, from which 40K diverse and high quality samples are curated as training set. We then introduce a pixel-level similarity reward to complement MLLM-based rewards, enabling models to ensure both editing quality and content consistency during the editing process. To overcome the spatial-agnostic nature of the rewards, we propose a region-based regularizer, aiming to preserve non-edited regions for high-reward samples while encouraging…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

wyh6666/GEditBench_ImgEditBench_with_mask
dataset· 40 dl
40 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.