Reducing Complexity of HEVC: A Deep Learning Approach

Mai Xu; Tianyi Li; Zulin Wang; Xin Deng; Ren Yang; Zhenyu Guan

arXiv:1710.01218·cs.CV·March 6, 2019

Reducing Complexity of HEVC: A Deep Learning Approach

Mai Xu, Tianyi Li, Zulin Wang, Xin Deng, Ren Yang, Zhenyu Guan

PDF

1 Repo

TL;DR

This paper introduces a deep learning method using CNN and LSTM to predict CU partitions in HEVC, significantly reducing encoding complexity while maintaining performance.

Contribution

It presents a novel hierarchical deep learning framework with ETH-CNN and ETH-LSTM for efficient CU partition prediction in HEVC.

Findings

01

Reduces HEVC encoding complexity by up to 50%.

02

Outperforms state-of-the-art complexity reduction methods.

03

Maintains comparable video quality with less computation.

Abstract

High Efficiency Video Coding (HEVC) significantly reduces bit-rates over the proceeding H.264 standard but at the expense of extremely high encoding complexity. In HEVC, the quad-tree partition of coding unit (CU) consumes a large proportion of the HEVC encoding complexity, due to the bruteforce search for rate-distortion optimization (RDO). Therefore, this paper proposes a deep learning approach to predict the CU partition for reducing the HEVC complexity at both intra- and inter-modes, which is based on convolutional neural network (CNN) and long- and short-term memory (LSTM) network. First, we establish a large-scale database including substantial CU partition data for HEVC intra- and inter-modes. This enables deep learning on the CU partition. Second, we represent the CU partition of an entire coding tree unit (CTU) in the form of a hierarchical CU partition map (HCPM). Then, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HEVC-Projects/CPH
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory