TIR-Agent: Training an Explorative and Efficient Agent for Image Restoration

Yisheng Zhang; Guoli Jia; Haote Hu; Shanxu Zhao; Kaikai Zhao; Long Sun; Xinwei Long; Kai Tian; Che Jiang; Zhaoxiang Liu; Kai Wang; Shiguo Lian; Kaiyan Zhang; Bowen Zhou

arXiv:2603.27742·cs.CV·March 31, 2026

TIR-Agent: Training an Explorative and Efficient Agent for Image Restoration

Yisheng Zhang, Guoli Jia, Haote Hu, Shanxu Zhao, Kaikai Zhao, Long Sun, Xinwei Long, Kai Tian, Che Jiang, Zhaoxiang Liu, Kai Wang, Shiguo Lian, Kaiyan Zhang, Bowen Zhou

PDF

TL;DR

TIR-Agent introduces a trainable, policy-based approach for image restoration that improves efficiency and performance over existing heuristic methods by using supervised fine-tuning and reinforcement learning.

Contribution

It proposes a novel trainable image restoration agent with a two-stage training pipeline, including exploration strategies and adaptive rewards, to optimize tool calling policies.

Findings

01

Outperforms 12 baseline methods in diverse degradation scenarios.

02

Achieves over 2.5 times faster inference speed by reducing redundant tool calls.

03

Demonstrates effectiveness on both in-domain and out-of-domain image degradations.

Abstract

Vision-language agents that orchestrate specialized tools for image restoration (IR) have emerged as a promising method, yet most existing frameworks operate in a training-free manner. They rely on heuristic task scheduling and exhaustive tool traversal, resulting in sub-optimal restoration paths and prohibitive computational cost. We argue that the core bottleneck lies in the absence of a learned policy to make decision, as a vision-language model cannot efficiently handle degradation-aware task ordering and tool composition. To this end, we propose TIR-Agent, a trainable image restoration agent that performs a direct tool-calling policy through a two-stage training pipeline of supervised fine-tuning (SFT) followed by reinforcement learning (RL). Two key designs underpin effective RL training: (i) a random perturbation strategy applied to the SFT data, which broadens the policy's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.