DVC: An End-to-end Deep Video Compression Framework

Guo Lu; Wanli Ouyang; Dong Xu; Xiaoyun Zhang; Chunlei Cai; Zhiyong Gao

arXiv:1812.00101·eess.IV·April 9, 2019·6 cites

DVC: An End-to-end Deep Video Compression Framework

Guo Lu, Wanli Ouyang, Dong Xu, Xiaoyun Zhang, Chunlei Cai, Zhiyong Gao

PDF

Open Access 4 Repos

TL;DR

This paper introduces DVC, an end-to-end deep learning framework for video compression that jointly optimizes motion estimation, residual encoding, and reconstruction, outperforming traditional standards like H.264 and rivaling H.265.

Contribution

It presents the first fully end-to-end neural network-based video compression model that integrates all components into a single trainable system.

Findings

01

Outperforms H.264 in PSNR

02

Comparable to H.265 in MS-SSIM

03

Joint optimization improves compression efficiency

Abstract

Conventional video compression approaches use the predictive coding architecture and encode the corresponding motion information and residual information. In this paper, taking advantage of both classical architecture in the conventional video compression method and the powerful non-linear representation ability of neural networks, we propose the first end-to-end video compression deep model that jointly optimizes all the components for video compression. Specifically, learning based optical flow estimation is utilized to obtain the motion information and reconstruct the current frames. Then we employ two auto-encoder style neural networks to compress the corresponding motion and residual information. All the modules are jointly learned through a single loss function, in which they collaborate with each other by considering the trade-off between reducing the number of compression bits…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Video Coding and Compression Technologies · Advanced Image Processing Techniques