TL;DR
This paper introduces a deep learning method that fuses multiple features from SPAD camera data to enhance the resolution and denoise depth images without additional sensors, benefiting real-time 3D imaging applications.
Contribution
A novel multi-feature fusion deep network designed specifically for SPAD camera data to improve depth image resolution and denoising without extra sensors.
Findings
Achieves four-fold depth resolution enhancement.
Provides effective denoising across various noise levels.
Demonstrates significant improvement in 3D imaging quality.
Abstract
Three-dimensional imaging plays an important role in imaging applications where it is necessary to record depth. The number of applications that use depth imaging is increasing rapidly, and examples include self-driving autonomous vehicles and auto-focus assist on smartphone cameras. Light detection and ranging (LIDAR) via single-photon sensitive detector (SPAD) arrays is an emerging technology that enables the acquisition of depth images at high frame rates. However, the spatial resolution of this technology is typically low in comparison to the intensity images recorded by conventional cameras. To increase the native resolution of depth images from a SPAD camera, we develop a deep network built specifically to take advantage of the multiple features that can be extracted from a camera's histogram data. The network is designed for a SPAD camera operating in a dual-mode such that it…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
