Machine Learning based Efficient QT-MTT Partitioning Scheme for VVC Intra Encoders
Alexandre Tissier, Wassim Hamidouche, Souhaiel Belhadj Dit Mdalsi,, Jarno Vanne, Franck Galpin, Daniel Menard

TL;DR
This paper introduces a machine learning-based method combining CNN and decision trees to efficiently predict VVC intra encoder partitioning, significantly reducing computational complexity with minimal impact on bitrate.
Contribution
It proposes a two-stage learning approach using CNN and decision trees for MTT partitioning prediction in VVC intra encoders, reducing complexity effectively.
Findings
46.6% complexity reduction with 0.86% bitrate increase
69.8% complexity reduction with 2.57% bitrate loss
outperforms state-of-the-art solutions in complexity-efficiency trade-off
Abstract
The next-generation Versatile Video Coding (VVC) standard introduces a new Multi-Type Tree (MTT) block partitioning structure that supports Binary-Tree (BT) and Ternary-Tree (TT) splits in both vertical and horizontal directions. This new approach leads to five possible splits at each block depth and thereby improves the coding efficiency of VVC over that of the preceding High Efficiency Video Coding (HEVC) standard, which only supports Quad-Tree (QT) partitioning with a single split per block depth. However, MTT also has brought a considerable impact on encoder computational complexity. In this paper, a two-stage learning-based technique is proposed to tackle the complexity overhead of MTT in VVC intra encoders. In our scheme, the input block is first processed by a Convolutional Neural Network (CNN) to predict its spatial features through a vector of probabilities describing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Coding and Compression Technologies · Advanced Vision and Imaging · Image and Video Quality Assessment
