Accelerating Video Inverse Problem Solvers with Autoregressive Diffusion Models

Taesung Kwon; Jonghyun Park; Hyungjin Chung; Jong Chul Ye

arXiv:2605.20624·cs.CV·May 21, 2026

Accelerating Video Inverse Problem Solvers with Autoregressive Diffusion Models

Taesung Kwon, Jonghyun Park, Hyungjin Chung, Jong Chul Ye

PDF

TL;DR

This paper introduces AVIS, an autoregressive diffusion-based framework for real-time video inverse problems, significantly reducing latency and increasing throughput compared to previous methods.

Contribution

The paper proposes AVIS, a streaming autoregressive diffusion model that drastically reduces latency and boosts throughput for video inverse problems, enabling near real-time performance.

Findings

01

AVIS reduces initial latency from 114s to 4s.

02

AVIS increases throughput from 0.71 to 1.18 FPS.

03

AVIS Flash achieves 5.91 FPS on a single GPU.

Abstract

Diffusion models provide powerful priors for zero-shot video inverse problems, but their real-time deployment is hindered by two inefficiencies: high initial latency caused by holistic video restoration, and low throughput resulting from multiple VAE passes to enforce measurement consistency in pixel space. To overcome these limitations, we propose Autoregressive Video Inverse problem Solver (AVIS). The AVIS framework leverages autoregressive video diffusion models to restore videos in a streaming manner, naturally eliminating latency bottlenecks. Specifically, AVIS initializes reverse diffusion with a measurement-consistent estimate, reducing the required sampling steps. Compared to leading non-autoregressive solvers, AVIS drastically reduces initial latency from 114s to 4s and increases throughput from 0.71 to 1.18 FPS while achieving superior restoration quality. We further introduce…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.