Dilated convolutional neural network-based deep reference picture   generation for video compression

Haoyue Tian; Pan Gao; Ran Wei; Manoranjan Paul

arXiv:2202.05514·eess.IV·February 14, 2022

Dilated convolutional neural network-based deep reference picture generation for video compression

Haoyue Tian, Pan Gao, Ran Wei, Manoranjan Paul

PDF

Open Access 1 Repo

TL;DR

This paper introduces a dilated CNN-based deep reference picture generator that enhances video compression by creating more relevant reference frames, leading to significant bit savings in VVC.

Contribution

It presents a novel deep reference picture generator using dilated CNNs integrated into VVC to improve motion compensation and compression efficiency.

Findings

01

Achieves 9.7% average bit savings in VVC.

02

Demonstrates improved reference picture relevance.

03

Validates effectiveness through comprehensive experiments.

Abstract

Motion estimation and motion compensation are indispensable parts of inter prediction in video coding. Since the motion vector of objects is mostly in fractional pixel units, original reference pictures may not accurately provide a suitable reference for motion compensation. In this paper, we propose a deep reference picture generator which can create a picture that is more relevant to the current encoding frame, thereby further reducing temporal redundancy and improving video compression efficiency. Inspired by the recent progress of Convolutional Neural Network(CNN), this paper proposes to use a dilated CNN to build the generator. Moreover, we insert the generated deep picture into Versatile Video Coding(VVC) as a reference picture and perform a comprehensive set of experiments to evaluate the effectiveness of our network on the latest VVC Test Model VTM. The experimental results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

luka0714/OurProject
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Video Coding and Compression Technologies · Advanced Vision and Imaging