TL;DR
This paper introduces MOVI-Codec, a novel deep learning video compression method that eliminates the need for motion estimation, achieving superior performance on high-resolution videos compared to standard codecs like H.264, HEVC, and H.266.
Contribution
The paper presents a motionless video compression framework using displaced frame differences and a novel LSTM-UNet network, reducing computational complexity and improving compression efficiency.
Findings
MOVI-Codec outperforms H.264 low-delay P veryfast in MS-SSIM.
MOVI-Codec exceeds HEVC performance at the same setting.
MOVI-Codec surpasses H.266 (VVC) at higher bitrates on high-resolution videos.
Abstract
With the development of higher resolution contents and displays, its significant volume poses significant challenges to the goals of acquiring, transmitting, compressing, and displaying high-quality video content. In this paper, we propose a new deep learning video compression architecture that does not require motion estimation, which is the most expensive element of modern hybrid video compression codecs like H.264 and HEVC. Our framework exploits the regularities inherent to video motion, which we capture by using displaced frame differences as video representations to train the neural network. In addition, we propose a new space-time reconstruction network based on both an LSTM model and a UNet model, which we call LSTM-UNet. The new video compression framework has three components: a Displacement Calculation Unit (DCU), a Displacement Compression Network (DCN), and a Frame…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
