Neural Video Compression using 2D Gaussian Splatting
Lakshya Gupta, Imran N. Junejo

TL;DR
This paper introduces a real-time neural video compression method using 2D Gaussian Splatting, significantly reducing encoding time and enabling efficient, content-aware video streaming suitable for applications like video conferencing.
Contribution
It presents the first Gaussian splatting-based neural video codec with real-time decoding, leveraging a novel initialization and redundancy reduction to improve speed and efficiency.
Findings
88% reduction in encoding time
Requires only thousands of Gaussians for quality
First Gaussian splatting-based neural video codec
Abstract
The computer vision and image processing research community has been involved in standardizing video data communications for the past many decades, leading to standards such as AVC, HEVC, VVC, AV1, AV2, etc. However, recent groundbreaking works have focused on employing deep learning-based techniques to replace the traditional video codec pipeline to a greater affect. Neural video codecs (NVC) create an end-to-end ML-based solution that does not rely on any handcrafted features (motion or edge-based) and have the ability to learn content-aware compression strategies, offering better adaptability and higher compression efficiency than traditional methods. This holds a great potential not only for hardware design, but also for various video streaming platforms and applications, especially video conferencing applications such as MS-Teams or Zoom that have found extensive usage in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
