Video Compression Meets Video Generation: Latent Inter-Frame Pruning with Attention Recovery

Dennis Menn; Yuedong Yang; Bokun Wang; Xiwen Wei; Mustafa Munir; Feng Liang; Radu Marculescu; Chenfeng Xu; Diana Marculescu

arXiv:2603.05811·cs.CV·April 30, 2026

Video Compression Meets Video Generation: Latent Inter-Frame Pruning with Attention Recovery

Dennis Menn, Yuedong Yang, Bokun Wang, Xiwen Wei, Mustafa Munir, Feng Liang, Radu Marculescu, Chenfeng Xu, Diana Marculescu

PDF

1 Repo

TL;DR

This paper introduces LIPAR, a framework that reduces video generation latency by pruning redundant latent patches and recovering attention, significantly increasing throughput without quality loss.

Contribution

The paper presents a novel latent inter-frame pruning method with an attention recovery mechanism that enhances real-time video generation efficiency.

Findings

01

Increases video editing throughput by 1.53 times

02

Achieves an average of 19.3 FPS on NVIDIA RTX 4090

03

Maintains generation quality without additional training

Abstract

Current video generation models suffer from high computational latency, making real-time applications prohibitively costly. In this paper, we address this limitation by exploiting the temporal redundancy inherent in video latent patches. To this end, we propose the Latent Inter-frame Pruning with Attention Recovery (LIPAR) framework, which detects and skips recomputing duplicated latent patches. Additionally, we introduce a novel Attention Recovery mechanism that approximates the attention values of pruned tokens, thereby removing visual artifacts arising from naively applying the pruning method. Empirically, our method increases video editing throughput by $1.53 \times$ , achieving an average of 19.3 FPS on an NVIDIA RTX 4090 with the 1.3B Self-Forcing model (4-step denoising, FP16). The proposed method does not compromise generation quality and can be seamlessly integrated with the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dennismenn/lipar
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.