Exploring Long- and Short-Range Temporal Information for Learned Video Compression
Huairui Wang, Zhenzhong Chen

TL;DR
This paper introduces a novel learned video compression method that leverages both long-range and short-range temporal information, using a temporal prior and progressive guided motion compensation to improve rate-distortion performance.
Contribution
It proposes a temporal prior for long-range information and a hierarchical, optical flow-guided motion compensation for short-range, enhancing learned video compression.
Findings
Outperforms state-of-the-art video compression methods in RD performance.
Effective exploitation of long- and short-range temporal information.
Code is publicly available for reproducibility.
Abstract
Learned video compression methods have gained a variety of interest in the video coding community since they have matched or even exceeded the rate-distortion (RD) performance of traditional video codecs. However, many current learning-based methods are dedicated to utilizing short-range temporal information, thus limiting their performance. In this paper, we focus on exploiting the unique characteristics of video content and further exploring temporal information to enhance compression performance. Specifically, for long-range temporal information exploitation, we propose temporal prior that can update continuously within the group of pictures (GOP) during inference. In that case temporal prior contains valuable temporal information of all decoded images within the current GOP. As for short-range temporal information, we propose a progressive guided motion compensation to achieve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Advanced Image and Video Retrieval Techniques
