HP-Edit: A Human-Preference Post-Training Framework for Image Editing

Fan Li; Chonghuinan Wang; Lina Lei; Yuping Qiu; Jiaqi Xu; Jiaxiu Jiang; Xinran Qin; Zhikai Chen; Fenglong Song; Zhixin Wang; Renjing Pei; Wangmeng Zuo

arXiv:2604.19406·cs.CV·April 22, 2026

HP-Edit: A Human-Preference Post-Training Framework for Image Editing

Fan Li, Chonghuinan Wang, Lina Lei, Yuping Qiu, Jiaqi Xu, Jiaxiu Jiang, Xinran Qin, Zhikai Chen, Fenglong Song, Zhixin Wang, Renjing Pei, Wangmeng Zuo

PDF

TL;DR

HP-Edit is a post-training framework that uses human preferences and a new dataset to improve image editing models' alignment with human expectations.

Contribution

It introduces HP-Edit, a novel framework utilizing human preference data and a pretrained VLM to enhance diffusion-based image editing.

Findings

01

Significant improvement in human preference alignment of image editing models.

02

Development of RealPref-50K, a large-scale human preference dataset.

03

Introduction of RealPref-Bench for evaluating real-world editing performance.

Abstract

Common image editing tasks typically adopt powerful generative diffusion models as the leading paradigm for real-world content editing. Meanwhile, although reinforcement learning (RL) methods such as Diffusion-DPO and Flow-GRPO have further improved generation quality, efficiently applying Reinforcement Learning from Human Feedback (RLHF) to diffusion-based editing remains largely unexplored, due to a lack of scalable human-preference datasets and frameworks tailored to diverse editing needs. To fill this gap, we propose HP-Edit, a post-training framework for Human Preference-aligned Editing, and introduce RealPref-50K, a real-world dataset across eight common tasks and balancing common object editing. Specifically, HP-Edit leverages a small amount of human-preference scoring data and a pretrained visual large language model (VLM) to develop HP-Scorer--an automatic, human…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.