TL;DR
This paper introduces a novel video super-resolution method that directly utilizes raw camera sensor data, employing a Hidden Markov Model-based inference framework and specialized neural modules, leading to superior results over existing methods.
Contribution
The paper presents a new VSR approach that exploits raw camera data and a Hidden Markov Model inference-based architecture, along with a dedicated raw video dataset for training and evaluation.
Findings
Outperforms state-of-the-art VSR methods on raw data
Effectively separates super-resolution and color correction processes
Demonstrates adaptability to different camera-ISP pipelines
Abstract
To the best of our knowledge, the existing deep-learning-based Video Super-Resolution (VSR) methods exclusively make use of videos produced by the Image Signal Processor (ISP) of the camera system as inputs. Such methods are 1) inherently suboptimal due to information loss incurred by non-invertible operations in ISP, and 2) inconsistent with the real imaging pipeline where VSR in fact serves as a pre-processing unit of ISP. To address this issue, we propose a new VSR method that can directly exploit camera sensor data, accompanied by a carefully built Raw Video Dataset (RawVD) for training, validation, and testing. This method consists of a Successive Deep Inference (SDI) module and a reconstruction module, among others. The SDI module is designed according to the architectural principle suggested by a canonical decomposition result for Hidden Markov Model (HMM) inference; it estimates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
