gQIR: Generative Quanta Image Reconstruction
Aryan Garg, Sizhuo Ma, Mohit Gupta

TL;DR
This paper introduces gQIR, a novel method that adapts large diffusion models to reconstruct high-quality images from sparse, noisy photon detection data in low-light conditions, outperforming existing approaches.
Contribution
The paper presents a new approach that integrates latent diffusion models with photon statistics handling for quanta burst imaging, enabling high-quality reconstructions in photon-limited scenarios.
Findings
Significant perceptual quality improvements over classical methods.
Effective handling of Bernoulli photon noise in image reconstruction.
Successful application to real-world color SPAD datasets and challenging video benchmarks.
Abstract
Capturing high-quality images from only a few detected photons is a fundamental challenge in computational imaging. Single-photon avalanche diode (SPAD) sensors promise high-quality imaging in regimes where conventional cameras fail, but raw \emph{quanta frames} contain only sparse, noisy, binary photon detections. Recovering a coherent image from a burst of such frames requires handling alignment, denoising, and demosaicing (for color) under noise statistics far outside those assumed by standard restoration pipelines or modern generative models. We present an approach that adapts large text-to-image latent diffusion models to the photon-limited domain of quanta burst imaging. Our method leverages the structural and semantic priors of internet-scale diffusion models while introducing mechanisms to handle Bernoulli photon statistics. By integrating latent-space restoration with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Domain Adaptation and Few-Shot Learning
