SoliReward: Mitigating Susceptibility to Reward Hacking and Annotation Noise in Video Generation Reward Models

Jiesong Lian; Ruizhe Zhong; Zixiang Zhou; Xiaoyue Mi; Long Hu; Yuan Zhou; Qinglin Lu; Yixue Hao; Junchi Yan

arXiv:2512.22170·cs.LG·March 17, 2026

SoliReward: Mitigating Susceptibility to Reward Hacking and Annotation Noise in Video Generation Reward Models

Jiesong Lian, Ruizhe Zhong, Zixiang Zhou, Xiaoyue Mi, Long Hu, Yuan Zhou, Qinglin Lu, Yixue Hao, Junchi Yan

PDF

Open Access 1 Models

TL;DR

SoliReward introduces a comprehensive framework for training video reward models that reduces annotation noise, mitigates reward hacking, and improves alignment with human preferences through novel data sourcing, architectural innovations, and loss functions.

Contribution

It proposes a systematic approach combining high-quality data collection, a new attention mechanism, and a modified loss to enhance video reward model training and alignment.

Findings

01

Improves RM evaluation metrics on benchmarks.

02

Enhances post-training video generation quality.

03

Reduces susceptibility to reward hacking.

Abstract

Post-training alignment of video generation models with human preferences is a critical goal. Developing effective Reward Models (RMs) for this process faces significant methodological hurdles. Current data collection paradigms, reliant on in-prompt pairwise annotations, suffer from labeling noise. Concurrently, the architectural design of VLM-based RMs, particularly their output mechanisms, remains underexplored. Furthermore, RM is susceptible to reward hacking in post-training. To mitigate these limitations, we propose SoliReward, a systematic framework for video RM training. Our framework first sources high-quality, cost-efficient data via single-item binary annotations, then constructs preference pairs using a cross-prompt pairing strategy. Architecturally, we employ a Hierarchical Progressive Query Attention mechanism to enhance feature aggregation. Finally, we introduce a modified…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
Yukino271828/SoliReward
model· ♡ 1
♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Human Pose and Action Recognition · Artificial Intelligence in Games