OARS: Process-Aware Online Alignment for Generative Real-World Image Super-Resolution
Shijie Zhao, Xuanyu Zhang, Bin Chen, Weiqi Li, Qunliang Xing, Kexin Zhang, Yan Wang, Junlin Li, Li Zhang, Jian Zhang, and Tianfan Xue

TL;DR
OARS introduces a process-aware online alignment framework for real-world image super-resolution, leveraging a new reward model to improve perceptual quality while preserving fidelity, outperforming previous methods on benchmarks.
Contribution
The paper presents OARS, a novel online alignment method with COMPASS, a MLLM-based reward, enabling adaptive, interpretable training for super-resolution models.
Findings
Achieves state-of-the-art results on Real-ISR benchmarks.
Improves perceptual quality while maintaining fidelity.
Demonstrates effectiveness through extensive experiments and user studies.
Abstract
Aligning generative real-world image super-resolution models with human visual preference is challenging due to the perception--fidelity trade-off and diverse, unknown degradations. Prior approaches rely on offline preference optimization and static metric aggregation, which are often non-interpretable and prone to pseudo-diversity under strong conditioning. We propose OARS, a process-aware online alignment framework built on COMPASS, a MLLM-based reward that evaluates the LR to SR transition by jointly modeling fidelity preservation and perceptual gain with an input-quality-adaptive trade-off. To train COMPASS, we curate COMPASS-20K spanning synthetic and real degradations, and introduce a three-stage perceptual annotation pipeline that yields calibrated, fine-grained training labels. Guided by COMPASS, OARS performs progressive online alignment from cold-start flow matching to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Generative Adversarial Networks and Image Synthesis · Image and Video Quality Assessment
