Progressive Image Restoration via Text-Conditioned Video Generation

Peng Kang; Xijun Wang; Yu Yuan

arXiv:2512.02273·cs.CV·December 3, 2025

Progressive Image Restoration via Text-Conditioned Video Generation

Peng Kang, Xijun Wang, Yu Yuan

PDF

Open Access

TL;DR

This paper adapts text-to-video models for progressive image restoration, enabling high-quality, temporally coherent enhancement of degraded images through synthetic datasets and prompting strategies.

Contribution

It introduces a novel approach to repurpose video generation models for image restoration, with scene-specific prompts and demonstrates zero-shot robustness on real-world data.

Findings

01

Improved perceptual metrics across restoration sequences

02

Effective spatial detail and illumination consistency

03

Strong zero-shot generalization to real-world scenarios

Abstract

Recent text-to-video models have demonstrated strong temporal generation capabilities, yet their potential for image restoration remains underexplored. In this work, we repurpose CogVideo for progressive visual restoration tasks by fine-tuning it to generate restoration trajectories rather than natural video motion. Specifically, we construct synthetic datasets for super-resolution, deblurring, and low-light enhancement, where each sample depicts a gradual transition from degraded to clean frames. Two prompting strategies are compared: a uniform text prompt shared across all samples, and a scene-specific prompting scheme generated via LLaVA multi-modal LLM and refined with ChatGPT. Our fine-tuned model learns to associate temporal progression with restoration quality, producing sequences that improve perceptual metrics such as PSNR, SSIM, and LPIPS across frames. Extensive experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Image Enhancement Techniques