Explainable Machine Learning based Transform Coding for High Efficiency Intra Prediction
Na Li, Yun Zhang, C.-C. Jay Kuo

TL;DR
This paper introduces an explainable machine learning based Saab transform for intra video coding, optimizing energy compaction and decorrelation to improve coding efficiency over traditional DCT methods.
Contribution
It proposes a novel Saab transform framework with offline learning and intra mode dependency, enhancing intra coding performance with theoretical and experimental validation.
Findings
Saab transform achieves up to 10% bitrate reduction.
Improves energy compaction and decorrelation compared to DCT.
Enhances intra video coding efficiency with multiple integration strategies.
Abstract
Machine learning techniques provide a chance to explore the coding performance potential of transform. In this work, we propose an explainable transform based intra video coding to improve the coding efficiency. Firstly, we model machine learning based transform design as an optimization problem of maximizing the energy compaction or decorrelation capability. The explainable machine learning based transform, i.e., Subspace Approximation with Adjusted Bias (Saab) transform, is analyzed and compared with the mainstream Discrete Cosine Transform (DCT) on their energy compaction and decorrelation capabilities. Secondly, we propose a Saab transform based intra video coding framework with off-line Saab transform learning. Meanwhile, intra mode dependent Saab transform is developed. Then, Rate Distortion (RD) gain of Saab transform based intra video coding is theoretically and experimentally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Coding and Compression Technologies · Image and Signal Denoising Methods · Digital Filter Design and Implementation
