Adapting MIMO video restoration networks to low latency constraints
Val\'ery Dewil, Zhe Zheng, Arnaud Barral, Lara Raad, Nao Nicolas,, Ioannis Cassagne, Jean-michel Morel, Gabriele Facciolo, Bruno Galerne, Pablo, Arias

TL;DR
This paper enhances MIMO neural networks for low-latency video restoration by addressing performance drops and artifacts through recurrence and overlapping techniques, achieving state-of-the-art results and introducing a new drone footage benchmark.
Contribution
It introduces simple, adaptable solutions to improve low-latency MIMO video restoration networks by increasing temporal receptive field and smoothing transitions, with new state-of-the-art performance.
Findings
Improved reconstruction error in low-latency settings
Enhanced temporal consistency in video outputs
Effective solutions applicable to various architectures
Abstract
MIMO (multiple input, multiple output) approaches are a recent trend in neural network architectures for video restoration problems, where each network evaluation produces multiple output frames. The video is split into non-overlapping stacks of frames that are processed independently, resulting in a very appealing trade-off between output quality and computational cost. In this work we focus on the low-latency setting by limiting the number of available future frames. We find that MIMO architectures suffer from problems that have received little attention so far, namely (1) the performance drops significantly due to the reduced temporal receptive field, particularly for frames at the borders of the stack, (2) there are strong temporal discontinuities at stack transitions which induce a step-wise motion artifact. We propose two simple solutions to alleviate these problems: recurrence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Video Coding and Compression Technologies · Telecommunications and Broadcasting Technologies
MethodsSoftmax · Attention Is All You Need · Focus
