MNeRV: A Multilayer Neural Representation for Videos
Qingling Chang, Haohui Yu, Shuxuan Fu, Zhiqiang Zeng, Chuangquan, Chen

TL;DR
MNeRV introduces a multilayer neural video representation with a new encoder-decoder architecture, significantly improving reconstruction quality and efficiency, and enhancing performance in downstream video tasks.
Contribution
It proposes a multilayer neural representation with a novel encoder-decoder design to reduce parameter redundancy and improve video regression performance.
Findings
Achieves +4.06 PSNR in reconstruction quality
Uses fewer parameters for better performance
Enhances downstream video restoration and interpolation
Abstract
As a novel video representation method, Neural Representations for Videos (NeRV) has shown great potential in the fields of video compression, video restoration, and video interpolation. In the process of representing videos using NeRV, each frame corresponds to an embedding, which is then reconstructed into a video frame sequence after passing through a small number of decoding layers (E-NeRV, HNeRV, etc.). However, this small number of decoding layers can easily lead to the problem of redundant model parameters due to the large proportion of parameters in a single decoding layer, which greatly restricts the video regression ability of neural network models. In this paper, we propose a multilayer neural representation for videos (MNeRV) and design a new decoder M-Decoder and its matching encoder M-Encoder. MNeRV has more encoding and decoding layers, which effectively alleviates the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Anomaly Detection Techniques and Applications
