Encoder-Decoder-Based Intra-Frame Block Partitioning Decision

Yucheng Jiang; Han Peng; Yan Song; Jie Yu; Peng Zhang; Songping Mai

arXiv:2310.06412·cs.MM·October 11, 2023

Encoder-Decoder-Based Intra-Frame Block Partitioning Decision

Yucheng Jiang, Han Peng, Yan Song, Jie Yu, Peng Zhang, Songping Mai

PDF

Open Access

TL;DR

This paper introduces a neural network-based method using CNN and Transformer to accelerate intra-frame block partitioning in video encoding, achieving significant time reduction with minimal performance loss.

Contribution

It presents a novel encoder-decoder neural network architecture that fully parallelizes intra-mode decision, significantly speeding up the process in video coding.

Findings

01

87.84% reduction in encoding time

02

8.09% decrease in coding performance

03

Effective acceleration with minimal quality loss

Abstract

The recursive intra-frame block partitioning decision process, a crucial component of the next-generation video coding standards, exerts significant influence over the encoding time. In this paper, we propose an encoder-decoder neural network (NN) to accelerate this process. Specifically, a CNN is utilized to compress the pixel data of the largest coding unit (LCU) into a fixed-length vector. Subsequently, a Transformer decoder is employed to transcribe the fixed-length vector into a variable-length vector, which represents the block partitioning outcomes of the encoding LCU. The vector transcription process adheres to the constraints imposed by the block partitioning algorithm. By fully parallelizing the NN prediction in the intra-mode decision, substantial time savings can be attained during the decision phase. The experimental results obtained from high-definition (HD) sequences…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Coding and Compression Technologies · Advanced Vision and Imaging · Advanced Image Processing Techniques