LRConv-NeRV: Low Rank Convolution for Efficient Neural Video Compression
Tamer Shanableh

TL;DR
LRConv-NeRV introduces a low-rank convolutional approach to neural video compression, significantly reducing computational complexity and model size while maintaining high reconstruction quality and temporal stability.
Contribution
It proposes a novel low-rank convolution method for NeRV, enabling controllable efficiency-quality trade-offs and substantial reductions in decoder complexity and size.
Findings
Reduces decoder GFLOPs by 68% when applied to final stage
Achieves approximately 9.2% bitrate reduction with negligible quality loss
Preserves temporal coherence and stability comparable to baseline
Abstract
Neural Representations for Videos (NeRV) encode entire video sequences within neural network parameters, offering an alternative paradigm to conventional video codecs. However, the convolutional decoder of NeRV remains computationally expensive and memory intensive, limiting its deployment in resource-constrained environments. This paper proposes LRConv-NeRV, an efficient NeRV variant that replaces selected dense 3x3 convolutional layers with structured low-rank separable convolutions, trained end-to-end within the decoder architecture. By progressively applying low-rank factorization from the largest to earlier decoder stages, LRConv-NeRV enables controllable trade-offs between reconstruction quality and efficiency. Extensive experiments demonstrate that applying LRConv only to the final decoder stage reduces decoder complexity by 68%, from 201.9 to 64.9 GFLOPs, and model size by 9.3%,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Video Coding and Compression Technologies · Image and Video Quality Assessment
