BLINC: Lightweight Bimodal Learning for Low-Complexity VVC Intra Coding
Farhad Pakdaman, Mohammad Ali Adelimanesh, Mahmoud Reza Hashemi

TL;DR
This paper introduces a lightweight bimodal machine learning approach for VVC intra coding that significantly reduces encoding time with minimal efficiency loss, suitable for low-power and real-time applications.
Contribution
It proposes a novel bimodal learning framework with feature reduction and training strategies to simplify intra mode decision in VVC, reducing complexity and overhead.
Findings
Up to 24% reduction in encoding time
Negligible 0.2% increase in computational overhead
Effective bimodal learning boosts performance
Abstract
The latest video coding standard, Versatile Video Coding (VVC), achieves almost twice coding efficiency compared to its predecessor, the High Efficiency Video Coding (HEVC). However, achieving this efficiency (for intra coding) requires 31x computational complexity compared to HEVC, making it challenging for low power and real-time applications. This paper, proposes a novel machine learning approach that jointly and separately employs two modalities of features, to simplify the intra coding decision. First a set of features are extracted that use the existing DCT core of VVC, to assess the texture characteristics, and forms the first modality of data. This produces high quality features with almost no overhead. The distribution of intra modes at the neighboring blocks is also used to form the second modality of data, which provides statistical information about the frame. Second, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
