Learning to Compress Videos without Computing Motion

Meixu Chen; Todd Goodall; Anjul Patney; and Alan C. Bovik

arXiv:2009.14110·eess.IV·April 6, 2022

Learning to Compress Videos without Computing Motion

Meixu Chen, Todd Goodall, Anjul Patney, and Alan C. Bovik

PDF

1 Repo

TL;DR

This paper introduces MOVI-Codec, a novel deep learning video compression method that eliminates the need for motion estimation, achieving superior performance on high-resolution videos compared to standard codecs like H.264, HEVC, and H.266.

Contribution

The paper presents a motionless video compression framework using displaced frame differences and a novel LSTM-UNet network, reducing computational complexity and improving compression efficiency.

Findings

01

MOVI-Codec outperforms H.264 low-delay P veryfast in MS-SSIM.

02

MOVI-Codec exceeds HEVC performance at the same setting.

03

MOVI-Codec surpasses H.266 (VVC) at higher bitrates on high-resolution videos.

Abstract

With the development of higher resolution contents and displays, its significant volume poses significant challenges to the goals of acquiring, transmitting, compressing, and displaying high-quality video content. In this paper, we propose a new deep learning video compression architecture that does not require motion estimation, which is the most expensive element of modern hybrid video compression codecs like H.264 and HEVC. Our framework exploits the regularities inherent to video motion, which we capture by using displaced frame differences as video representations to train the neural network. In addition, we propose a new space-time reconstruction network based on both an LSTM model and a UNet model, which we call LSTM-UNet. The new video compression framework has three components: a Displacement Calculation Unit (DCU), a Displacement Compression Network (DCN), and a Frame…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Meixu-Chen/MOVI-Codec
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory