Attention-Based Neural Networks for Chroma Intra Prediction in Video Coding
Marc G\'orriz, Saverio Blasi, Alan F. Smeaton, Noel E. O'Connor, Marta, Mrak

TL;DR
This paper introduces simplified, cost-effective attention-based neural network architectures for chroma intra-prediction in video coding, achieving significant complexity reduction while maintaining state-of-the-art compression performance.
Contribution
It proposes a novel size-agnostic multi-model approach and various simplifications to reduce the complexity of neural network-based chroma intra-prediction.
Findings
Achieves around 90% reduction in model parameters.
Maintains compression efficiency comparable to state-of-the-art methods.
Offers multiple simplification strategies for hardware-aware implementation.
Abstract
Neural networks can be successfully used to improve several modules of advanced video coding schemes. In particular, compression of colour components was shown to greatly benefit from usage of machine learning models, thanks to the design of appropriate attention-based architectures that allow the prediction to exploit specific samples in the reference region. However, such architectures tend to be complex and computationally intense, and may be difficult to deploy in a practical video coding pipeline. This work focuses on reducing the complexity of such methodologies, to design a set of simplified and cost-effective attention-based architectures for chroma intra-prediction. A novel size-agnostic multi-model approach is proposed to reduce the complexity of the inference process. The resulting simplified architecture is still capable of outperforming state-of-the-art methods. Moreover, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
