ELF-VC: Efficient Learned Flexible-Rate Video Coding
Oren Rippel, Alexander G. Anderson, Kedar Tatwawadi, Sanjay Nair,, Craig Lytle, Lubomir Bourdev

TL;DR
ELF-VC introduces a novel learned video coding method that achieves superior compression efficiency and speed compared to traditional standards and existing ML codecs, with a flexible-rate framework and optimized architecture.
Contribution
The paper presents a flexible-rate, efficient backbone, and a new in-loop flow prediction scheme for learned video compression, significantly improving performance and computational efficiency.
Findings
Reduces BD-rate by up to 44% against H.264 on UVG.
Runs at least 5x faster than comparable ML codecs.
Achieves real-time encoding/decoding at various resolutions.
Abstract
While learned video codecs have demonstrated great promise, they have yet to achieve sufficient efficiency for practical deployment. In this work, we propose several novel ideas for learned video compression which allow for improved performance for the low-latency mode (I- and P-frames only) along with a considerable increase in computational efficiency. In this setting, for natural videos our approach compares favorably across the entire R-D curve under metrics PSNR, MS-SSIM and VMAF against all mainstream video standards (H.264, H.265, AV1) and all ML codecs. At the same time, our approach runs at least 5x faster and has fewer parameters than all ML codecs which report these figures. Our contributions include a flexible-rate framework allowing a single model to cover a large and dense range of bitrates, at a negligible increase in computation and parameter count; an efficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
