Fine-Grained Motion Compression and Selective Temporal Fusion for Neural B-Frame Video Coding
Xihua Sheng, Peilin Chen, Meng Wang, Li Zhang, Shiqi Wang, Dapeng Oliver Wu

TL;DR
This paper introduces novel motion compression and temporal fusion techniques for neural B-frame video coding, significantly improving compression efficiency and outperforming existing codecs.
Contribution
It presents a fine-grained motion auto-encoder with adaptive quantization and an interactive entropy model, along with a selective temporal fusion method with implicit alignment.
Findings
Achieves approximately 10% BD-rate reduction over state-of-the-art neural B-frame codec.
Delivers comparable or better compression than H.266/VVC under random-access.
Demonstrates effectiveness through extensive experiments.
Abstract
With the remarkable progress in neural P-frame video coding, neural B-frame coding has recently emerged as a critical research direction. However, most existing neural B-frame codecs directly adopt P-frame coding tools without adequately addressing the unique challenges of B-frame compression, leading to suboptimal performance. To bridge this gap, we propose novel enhancements for motion compression and temporal fusion for neural B-frame coding. First, we design a fine-grained motion compression method. This method incorporates an interactive dual-branch motion auto-encoder with per-branch adaptive quantization steps, which enables fine-grained compression of bi-directional motion vectors while accommodating their asymmetric bitrate allocation and reconstruction quality requirements. Furthermore, this method involves an interactive motion entropy model that exploits correlations between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Video Coding and Compression Technologies · Generative Adversarial Networks and Image Synthesis
MethodsADaptive gradient method with the OPTimal convergence rate
