Restore-R1: Efficient Image Restoration Agents via Reinforcement Learning with Multimodal LLM Perceptual Feedback

Jianglin Lu; Yuanwei Wu; Ziyi Zhao; Hongcheng Wang; Felix Jimenez; Abrar Majeedi; Yun Fu

arXiv:2512.18599·cs.CV·April 7, 2026

Restore-R1: Efficient Image Restoration Agents via Reinforcement Learning with Multimodal LLM Perceptual Feedback

Jianglin Lu, Yuanwei Wu, Ziyi Zhao, Hongcheng Wang, Felix Jimenez, Abrar Majeedi, Yun Fu

PDF

TL;DR

This paper introduces a reinforcement learning-based image restoration agent that efficiently determines restoration steps using multimodal LLM feedback, achieving state-of-the-art results without supervision.

Contribution

It proposes a label-free, policy optimization framework with a multimodal LLM reward mechanism, enabling efficient and effective image restoration without extensive annotations.

Findings

01

Matches SOTA on full-reference metrics without supervision

02

Outperforms existing methods on no-reference metrics

03

Significantly accelerates inference by reducing redundant tool calls

Abstract

Complex image restoration aims to recover high-quality images from inputs affected by multiple degradations such as blur, noise, rain, and compression artifacts. Recent restoration agents, powered by vision-language models and large language models, offer promising restoration capabilities but suffer from significant efficiency bottlenecks due to reflection, rollback, and iterative tool searching. Moreover, their performance heavily depends on degradation recognition models that require extensive annotations for training, limiting their applicability in label-free environments. To address these limitations, we propose a policy optimization-based restoration framework that learns an lightweight agent to determine tool-calling sequences. The agent operates in a sequential decision process, selecting the most appropriate restoration operation at each step to maximize final image quality.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.