AlphaVC: High-Performance and Efficient Learned Video Compression
Yibo Shi, Yunying Ge, Jing Wang, Jue Mao

TL;DR
AlphaVC is a novel learned video compression scheme that surpasses traditional standards in quality and speed by introducing key techniques like conditional-I-frames, pixel-to-feature motion prediction, and entropy skipping.
Contribution
It presents the first end-to-end AI video codec that exceeds VVC in PSNR and MSSSIM metrics while maintaining fast encoding and decoding speeds.
Findings
Outperforms VVC with -28.2% BD-rate in PSNR
Achieves -52.2% BD-rate in MSSSIM
Offers significantly faster encoding and decoding speeds
Abstract
Recently, learned video compression has drawn lots of attention and show a rapid development trend with promising results. However, the previous works still suffer from some criticial issues and have a performance gap with traditional compression standards in terms of widely used PSNR metric. In this paper, we propose several techniques to effectively improve the performance. First, to address the problem of accumulative error, we introduce a conditional-I-frame as the first frame in the GoP, which stabilizes the reconstructed quality and saves the bit-rate. Second, to efficiently improve the accuracy of inter prediction without increasing the complexity of decoder, we propose a pixel-to-feature motion prediction method at encoder side that helps us to obtain high-quality motion information. Third, we propose a probability-based entropy skipping method, which not only brings performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Coding and Compression Technologies · Advanced Vision and Imaging · Advanced Image Processing Techniques
MethodsTest
