Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation

Xiangyu Zhao; Peiyuan Zhang; Junming Lin; Tianhao Liang; Yuchen Duan; Shengyuan Ding; Changyao Tian; Yuhang Zang; Junchi Yan; Xue Yang

arXiv:2603.12247·cs.CV·March 13, 2026

Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation

Xiangyu Zhao, Peiyuan Zhang, Junming Lin, Tianhao Liang, Yuchen Duan, Shengyuan Ding, Changyao Tian, Yuhang Zang, Junchi Yan, Xue Yang

PDF

Open Access

TL;DR

This paper introduces FIRM, a robust reward modeling framework for faithful image editing and generation, improving alignment with human judgment and reducing hallucinations in reinforcement learning-based image synthesis.

Contribution

The paper develops high-quality datasets, specialized reward models, and a novel reward strategy, advancing the fidelity and reliability of RL-guided image editing and generation.

Findings

01

FIRM models outperform existing metrics in alignment with human judgment.

02

FIRM reduces hallucinations and improves fidelity in image editing and generation.

03

The framework achieves state-of-the-art performance on benchmark tasks.

Abstract

Reinforcement learning (RL) has emerged as a promising paradigm for enhancing image editing and text-to-image (T2I) generation. However, current reward models, which act as critics during RL, often suffer from hallucinations and assign noisy scores, inherently misguiding the optimization process. In this paper, we present FIRM (Faithful Image Reward Modeling), a comprehensive framework that develops robust reward models to provide accurate and reliable guidance for faithful image generation and editing. First, we design tailored data curation pipelines to construct high-quality scoring datasets. Specifically, we evaluate editing using both execution and consistency, while generation is primarily assessed via instruction following. Using these pipelines, we collect the FIRM-Edit-370K and FIRM-Gen-293K datasets, and train specialized reward models (FIRM-Edit-8B and FIRM-Gen-8B) that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Cell Image Analysis Techniques · Aesthetic Perception and Analysis