DeepQTMT: A Deep Learning Approach for Fast QTMT-based CU Partition of   Intra-mode VVC

Tianyi Li; Mai Xu; Runzhi Tang; Ying Chen; Qunliang Xing

arXiv:2006.13125·eess.IV·June 8, 2021

DeepQTMT: A Deep Learning Approach for Fast QTMT-based CU Partition of Intra-mode VVC

Tianyi Li, Mai Xu, Runzhi Tang, Ying Chen, Qunliang Xing

PDF

1 Repo

TL;DR

This paper introduces a deep learning method to predict CU partitioning in VVC, significantly reducing encoding time while maintaining near-optimal compression performance.

Contribution

It proposes a multi-stage exit CNN with an adaptive loss function for fast CU partition prediction in intra-mode VVC, reducing complexity effectively.

Findings

01

Encoding time reduced by up to 66.88%

02

BD-BR increase limited to around 3.2%

03

Outperforms existing state-of-the-art methods

Abstract

Versatile Video Coding (VVC), as the latest standard, significantly improves the coding efficiency over its ancestor standard High Efficiency Video Coding (HEVC), but at the expense of sharply increased complexity. In VVC, the quad-tree plus multi-type tree (QTMT) structure of coding unit (CU) partition accounts for over 97% of the encoding time, due to the brute-force search for recursive rate-distortion (RD) optimization. Instead of the brute-force QTMT search, this paper proposes a deep learning approach to predict the QTMT-based CU partition, for drastically accelerating the encoding process of intra-mode VVC. First, we establish a large-scale database containing sufficient CU partition patterns with diverse video content, which can facilitate the data-driven VVC complexity reduction. Next, we propose a multi-stage exit CNN (MSE-CNN) model with an early-exit mechanism to determine…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tianyili2017/CPIV
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAdaptive Robust Loss