TL;DR
This paper presents a novel joint translation-stereo learning approach for nighttime depth estimation that does not require ground-truth disparities for night images, effectively handling challenging lighting effects and uninformative regions.
Contribution
The proposed method introduces a joint training framework combining day/night image translation and stereo depth estimation without needing paired night disparity ground-truths.
Findings
Outperforms baseline methods on nighttime stereo depth estimation
Effectively handles glow, glare, and low-light regions
Maintains structure and smoothness in depth predictions
Abstract
Nighttime stereo depth estimation is still challenging, as assumptions associated with daytime lighting conditions do not hold any longer. Nighttime is not only about low-light and dense noise, but also about glow/glare, flares, non-uniform distribution of light, etc. One of the possible solutions is to train a network on night stereo images in a fully supervised manner. However, to obtain proper disparity ground-truths that are dense, independent from glare/glow, and have sufficiently far depth ranges is extremely intractable. To address the problem, we introduce a network joining day/night translation and stereo. In training the network, our method does not require ground-truth disparities of the night images, or paired day/night images. We utilize a translation network that can render realistic night stereo images from day stereo images. We then train a stereo network on the rendered…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
