End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression
M.Ak{\i}n Y{\i}lmaz, A.Murat Tekalp

TL;DR
This paper introduces a learned hierarchical bi-directional video codec that leverages end-to-end optimization and outperforms traditional codecs in rate-distortion efficiency, demonstrating significant improvements in PSNR and MS-SSIM.
Contribution
It proposes a novel learned hierarchical bi-directional video codec combining hierarchical motion prediction with end-to-end training, achieving state-of-the-art results.
Findings
Achieves the best reported R-D results for learned VC in PSNR and MS-SSIM.
Outperforms x265, SVT-HEVC, and HM in PSNR and MS-SSIM.
Shows performance gains from learned masking, flow-field subsampling, and temporal flow vector prediction.
Abstract
Conventional video compression (VC) methods are based on motion compensated transform coding, and the steps of motion estimation, mode and quantization parameter selection, and entropy coding are optimized individually due to the combinatorial nature of the end-to-end optimization problem. Learned VC allows end-to-end rate-distortion (R-D) optimized training of nonlinear transform, motion and entropy model simultaneously. Most works on learned VC consider end-to-end optimization of a sequential video codec based on R-D loss averaged over pairs of successive frames. It is well-known in conventional VC that hierarchical, bi-directional coding outperforms sequential compression because of its ability to use both past and future reference frames. This paper proposes a learned hierarchical bi-directional video codec (LHBDC) that combines the benefits of hierarchical motion-compensated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Coding and Compression Technologies · Advanced Data Compression Techniques · Advanced Image Processing Techniques
