BLINC: Lightweight Bimodal Learning for Low-Complexity VVC Intra Coding

Farhad Pakdaman; Mohammad Ali Adelimanesh; Mahmoud Reza Hashemi

arXiv:2201.07823·cs.MM·December 13, 2022

BLINC: Lightweight Bimodal Learning for Low-Complexity VVC Intra Coding

Farhad Pakdaman, Mohammad Ali Adelimanesh, Mahmoud Reza Hashemi

PDF

TL;DR

This paper introduces a lightweight bimodal machine learning approach for VVC intra coding that significantly reduces encoding time with minimal efficiency loss, suitable for low-power and real-time applications.

Contribution

It proposes a novel bimodal learning framework with feature reduction and training strategies to simplify intra mode decision in VVC, reducing complexity and overhead.

Findings

01

Up to 24% reduction in encoding time

02

Negligible 0.2% increase in computational overhead

03

Effective bimodal learning boosts performance

Abstract

The latest video coding standard, Versatile Video Coding (VVC), achieves almost twice coding efficiency compared to its predecessor, the High Efficiency Video Coding (HEVC). However, achieving this efficiency (for intra coding) requires 31x computational complexity compared to HEVC, making it challenging for low power and real-time applications. This paper, proposes a novel machine learning approach that jointly and separately employs two modalities of features, to simplify the intra coding decision. First a set of features are extracted that use the existing DCT core of VVC, to assess the texture characteristics, and forms the first modality of data. This produces high quality features with almost no overhead. The distribution of intra modes at the neighboring blocks is also used to form the second modality of data, which provides statistical information about the frame. Second, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.