Accelerating Video Inverse Problem Solvers with Autoregressive Diffusion Models
Taesung Kwon, Jonghyun Park, Hyungjin Chung, Jong Chul Ye

TL;DR
This paper introduces AVIS, an autoregressive diffusion-based framework for real-time video inverse problems, significantly reducing latency and increasing throughput compared to previous methods.
Contribution
The paper proposes AVIS, a streaming autoregressive diffusion model that drastically reduces latency and boosts throughput for video inverse problems, enabling near real-time performance.
Findings
AVIS reduces initial latency from 114s to 4s.
AVIS increases throughput from 0.71 to 1.18 FPS.
AVIS Flash achieves 5.91 FPS on a single GPU.
Abstract
Diffusion models provide powerful priors for zero-shot video inverse problems, but their real-time deployment is hindered by two inefficiencies: high initial latency caused by holistic video restoration, and low throughput resulting from multiple VAE passes to enforce measurement consistency in pixel space. To overcome these limitations, we propose Autoregressive Video Inverse problem Solver (AVIS). The AVIS framework leverages autoregressive video diffusion models to restore videos in a streaming manner, naturally eliminating latency bottlenecks. Specifically, AVIS initializes reverse diffusion with a measurement-consistent estimate, reducing the required sampling steps. Compared to leading non-autoregressive solvers, AVIS drastically reduces initial latency from 114s to 4s and increases throughput from 0.71 to 1.18 FPS while achieving superior restoration quality. We further introduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
