Multi-Scale Deformable Alignment and Content-Adaptive Inference for Flexible-Rate Bi-Directional Video Compression
M.Ak{\i}n Y{\i}lmaz, O.Ugur Ulas, A.Murat Tekalp

TL;DR
This paper introduces a flexible-rate bi-directional video compression method with adaptive motion compensation and content-aware inference, achieving state-of-the-art rate-distortion performance in learned video coding.
Contribution
It proposes a novel multi-scale deformable alignment and motion-content adaptive inference for improved end-to-end video compression.
Findings
Outperforms prior learned video coding methods in rate-distortion metrics.
Enables flexible rate operation with a single model using a gain unit.
Achieves state-of-the-art compression efficiency.
Abstract
The lack of ability to adapt the motion compensation model to video content is an important limitation of current end-to-end learned video compression models. This paper advances the state-of-the-art by proposing an adaptive motion-compensation model for end-to-end rate-distortion optimized hierarchical bi-directional video compression. In particular, we propose two novelties: i) a multi-scale deformable alignment scheme at the feature level combined with multi-scale conditional coding, ii) motion-content adaptive inference. In addition, we employ a gain unit, which enables a single model to operate at multiple rate-distortion operating points. We also exploit the gain unit to control bit allocation among intra-coded vs. bi-directionally coded frames by fine tuning corresponding models for truly flexible-rate learned video coding. Experimental results demonstrate state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Video Coding and Compression Technologies · Advanced Image Processing Techniques
