High Visual-Fidelity Learned Video Compression
Meng Li, Yibo Shi, Jing Wang, Yunqi Huang

TL;DR
This paper introduces HVFVC, a learned video compression framework that enhances perceptual visual quality by addressing reconstruction and artifact issues, outperforming traditional standards at lower bitrates.
Contribution
The paper proposes a confidence-based feature reconstruction and a periodic compensation loss to improve perceptual quality in learned video compression.
Findings
HVFVC achieves superior perceptual quality compared to VVC.
It requires only 50% of the bitrate used by traditional standards.
The method effectively reduces checkerboard artifacts and improves reconstruction in new regions.
Abstract
With the growing demand for video applications, many advanced learned video compression methods have been developed, outperforming traditional methods in terms of objective quality metrics such as PSNR. Existing methods primarily focus on objective quality but tend to overlook perceptual quality. Directly incorporating perceptual loss into a learned video compression framework is nontrivial and raises several perceptual quality issues that need to be addressed. In this paper, we investigated these issues in learned video compression and propose a novel High Visual-Fidelity Learned Video Compression framework (HVFVC). Specifically, we design a novel confidence-based feature reconstruction method to address the issue of poor reconstruction in newly-emerged regions, which significantly improves the visual quality of the reconstruction. Furthermore, we present a periodic compensation loss…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Advanced Vision and Imaging · Video Coding and Compression Technologies
MethodsFocus
