360 Depth Estimation in the Wild -- The Depth360 Dataset and the SegFuse Network
Qi Feng, Hubert P. H. Shum, Shigeo Morishima

TL;DR
This paper introduces Depth360, a large-scale dataset for 360-degree depth estimation, and SegFuse, a novel multi-task network that improves depth prediction accuracy from omnidirectional images by leveraging geometric and temporal constraints.
Contribution
The work presents a new dataset and a multi-branch neural network architecture that enhances 360-degree depth estimation by integrating geometric, temporal, and semantic information.
Findings
Depth360 dataset enables better training for omnidirectional depth estimation.
SegFuse outperforms existing methods on benchmark tests.
The approach produces more consistent and detailed depth maps.
Abstract
Single-view depth estimation from omnidirectional images has gained popularity with its wide range of applications such as autonomous driving and scene reconstruction. Although data-driven learning-based methods demonstrate significant potential in this field, scarce training data and ineffective 360 estimation algorithms are still two key limitations hindering accurate estimation across diverse domains. In this work, we first establish a large-scale dataset with varied settings called Depth360 to tackle the training data problem. This is achieved by exploring the use of a plenteous source of data, 360 videos from the internet, using a test-time training method that leverages unique information in each omnidirectional sequence. With novel geometric and temporal constraints, our method generates consistent and convincing depth samples to facilitate single-view estimation. We then propose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Robotics and Sensor-Based Localization
