Utilising Low Complexity CNNs to Lift Non-Local Redundancies in Video Coding
Jan P. Klopp, Liang-Gee Chen, Shao-Yi Chien

TL;DR
This paper introduces low-complexity CNNs trained during encoding to exploit non-local redundancies in video, improving compression efficiency with minimal computational overhead, suitable for resource-constrained environments.
Contribution
It presents a novel low-memory CNN-based method for enhancing video coding by exploiting non-local redundancies, trained on-the-fly and integrated into existing codecs.
Findings
Achieves coding gains comparable to pretrained denoising CNNs
Requires only about 1% of the computational complexity of prior CNN methods
Performs effectively on long video segments and high-resolution frames
Abstract
Digital media is ubiquitous and produced in ever-growing quantities. This necessitates a constant evolution of compression techniques, especially for video, in order to maintain efficient storage and transmission. In this work, we aim at exploiting non-local redundancies in video data that remain difficult to erase for conventional video codecs. We design convolutional neural networks with a particular emphasis on low memory and computational footprint. The parameters of those networks are trained on the fly, at encoding time, to predict the residual signal from the decoded video signal. After the training process has converged, the parameters are compressed and signalled as part of the code of the underlying video codec. The method can be applied to any existing video codec to increase coding gains while its low computational footprint allows for an application under…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
