TL;DR
BiECVC introduces a novel bidirectional video compression framework that enhances context modeling and gating, achieving state-of-the-art compression efficiency surpassing traditional codecs across standard datasets.
Contribution
The paper proposes BiECVC, a new learned BVC method with diversified local and non-local context modeling and adaptive gating, outperforming existing methods and surpassing VTM 13.2 in all tests.
Findings
Reduces bit-rate by up to 15.7% compared to VTM 13.2.
First learned video codec to outperform VTM 13.2 RA on all standard datasets.
Achieves state-of-the-art compression performance in learned video coding.
Abstract
Recent forward prediction-based learned video compression (LVC) methods have achieved impressive results, even surpassing VVC reference software VTM under the Low Delay B (LDB) configuration. In contrast, learned bidirectional video compression (BVC) remains underexplored and still lags behind its forward-only counterparts. This performance gap is mainly due to the limited ability to extract diverse and accurate contexts: most existing BVCs primarily exploit temporal motion while neglecting non-local correlations across frames. Moreover, they lack the adaptability to dynamically suppress harmful contexts arising from fast motion or occlusion. To tackle these challenges, we propose BiECVC, a BVC framework that incorporates diversified local and non-local context modeling along with adaptive context gating. For local context enhancement, BiECVC reuses high-quality features from lower…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSoftmax · Attention Is All You Need · ADaptive gradient method with the OPTimal convergence rate
