Generalizable Non-Line-of-Sight Imaging with Learnable Physical Priors

Shida Sun; Yue Li; Yueyi Zhang; Zhiwei Xiong

arXiv:2409.14011·cs.CV·September 24, 2024

Generalizable Non-Line-of-Sight Imaging with Learnable Physical Priors

Shida Sun, Yue Li, Yueyi Zhang, Zhiwei Xiong

PDF

Open Access 3 Reviews

TL;DR

This paper introduces a learning-based NLOS imaging method with learnable physical priors that generalizes well across real-world datasets, even with low SNR, by adaptively compensating paths and spectrum selection.

Contribution

It proposes a novel approach with Learnable Path Compensation and Adaptive Phasor Field, enabling better generalization and robustness in NLOS imaging compared to prior fixed-prior methods.

Findings

01

Effective in low SNR conditions

02

Generalizes across different real-world datasets

03

Trained solely on synthetic data

Abstract

Non-line-of-sight (NLOS) imaging, recovering the hidden volume from indirect reflections, has attracted increasing attention due to its potential applications. Despite promising results, existing NLOS reconstruction approaches are constrained by the reliance on empirical physical priors, e.g., single fixed path compensation. Moreover, these approaches still possess limited generalization ability, particularly when dealing with scenes at a low signal-to-noise ratio (SNR). To overcome the above problems, we introduce a novel learning-based solution, comprising two key designs: Learnable Path Compensation (LPC) and Adaptive Phasor Field (APF). The LPC applies tailored path compensation coefficients to adapt to different objects in the scene, effectively reducing light wave attenuation, especially in distant regions. Meanwhile, the APF learns the precise Gaussian window of the illumination…

Peer Reviews

Decision·Submitted to ICLR 2025

Reviewer 01Rating 6Confidence 4

Strengths

Typically training on synthetic data does not generalize to real data due to synthetic and real gap. However, the proposed method learns to compensate for radiometric falloff and suppress background noise using synthetic training data and is able to generalize to real-world data.

Weaknesses

The paper lacks discussion on the choice of G_z for initial compensation. It’s also unclear how much the 3D CNN block is contributing compared to the initially compensated features. The paper shows results on a custom dataset with the “composite” scene consisting of multiple surface materials. However, there is no quantitative evaluation on how different surface materials at different distances are compensated by LPC.

Reviewer 02Rating 6Confidence 4

Strengths

The paper is well motivated and distinguished clearly from previous works, as the method is structured around leveraging flexibility in the virtual phasor field formulation that was not used by prior methods to improve NLOS reconstruction performance. The improvement in reconstruction performance is clearly demonstrated with extensive experiments on existing and newly gathered datasets across a large number of NLOS reconstruction methods.

Weaknesses

Though the method shows strong performance compared to baselines, I'm not sure the method is better for the reasons that the authors say, rather than just because the method has more parameters and is trained on more data than previous methods. In particular, the authors make several claims about why their method works better that I am not sure are precisely substantiated in the paper. First, the authors claim that the using adaptive coefficients for the radiometric intensity falloff is better

Reviewer 03Rating 5Confidence 4

Strengths

The originality of this paper arises from the improvement of an existing NLOS reconstruction network LFE (Chen et al., 2020). The proposed method can achieve global intensity consistency and avoid manual adjustment of empirical parameters.

Weaknesses

IMO, the physical explanation of this paper is inaccurate and misleading, especially the explanation of the learning of the Gaussian window. And the improvement of the results is limited quantitatively and qualitatively, especially compared with NLOST.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Optical Sensing Technologies · Photoacoustic and Ultrasonic Imaging · Optical Imaging and Spectroscopy Techniques

MethodsSoftmax · Attention Is All You Need