TL;DR
This paper introduces a hybrid, learnable, and interpretable image signal processor (ISP) model that uses dictionaries for bidirectional RAW-RGB mapping, enabling high-quality image reconstruction and denoising with limited data.
Contribution
The authors propose a novel invertible, dictionary-based ISP model that combines interpretability with end-to-end learnability, addressing data scarcity and control issues in traditional methods.
Findings
Achieves state-of-the-art results in RAW image reconstruction.
Performs effectively in RAW denoising tasks with limited data.
Enables data augmentation through learned dictionaries.
Abstract
Digital cameras transform sensor RAW readings into RGB images by means of their Image Signal Processor (ISP). Computational photography tasks such as image denoising and colour constancy are commonly performed in the RAW domain, in part due to the inherent hardware design, but also due to the appealing simplicity of noise statistics that result from the direct sensor readings. Despite this, the availability of RAW images is limited in comparison with the abundance and diversity of available RGB data. Recent approaches have attempted to bridge this gap by estimating the RGB to RAW mapping: handcrafted model-based methods that are interpretable and controllable usually require manual parameter fine-tuning, while end-to-end learnable neural networks require large amounts of training data, at times with complex training procedures, and generally lack interpretability and parametric control.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
MethodsSelective Kernel Convolution · Convolution
