Towards Blind Bitstream-corrupted Video Recovery via a Visual Foundation Model-driven Framework

Tianyi Liu; Kejun Wu; Chen Cai; Yi Wang; Kim-Hui Yap; Lap-Pui Chau

arXiv:2507.22481·eess.IV·October 30, 2025

Towards Blind Bitstream-corrupted Video Recovery via a Visual Foundation Model-driven Framework

Tianyi Liu, Kejun Wu, Chen Cai, Yi Wang, Kim-Hui Yap, Lap-Pui Chau

PDF

TL;DR

This paper introduces a novel blind video recovery framework that leverages visual foundation models and corruption-aware modules to effectively restore videos corrupted at the bitstream level without manual annotations.

Contribution

It presents the first blind recovery approach integrating visual foundation models with corruption-aware modules, eliminating the need for manual corrupted region annotations.

Findings

01

Achieves superior recovery performance on corrupted videos.

02

Effectively localizes corruption without manual masks.

03

Enhances residual information processing for better restoration.

Abstract

Video signals are vulnerable in multimedia communication and storage systems, as even slight bitstream-domain corruption can lead to significant pixel-domain degradation. To recover faithful spatio-temporal content from corrupted inputs, bitstream-corrupted video recovery has recently emerged as a challenging and understudied task. However, existing methods require time-consuming and labor-intensive annotation of corrupted regions for each corrupted video frame, resulting in a large workload in practice. In addition, high-quality recovery remains difficult as part of the local residual information in corrupted frames may mislead feature completion and successive content recovery. In this paper, we propose the first blind bitstream-corrupted video recovery framework that integrates visual foundation models with a recovery model, which is adapted to different types of corruption and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.