360 Depth Estimation in the Wild -- The Depth360 Dataset and the SegFuse   Network

Qi Feng; Hubert P. H. Shum; Shigeo Morishima

arXiv:2202.08010·cs.CV·February 17, 2022

360 Depth Estimation in the Wild -- The Depth360 Dataset and the SegFuse Network

Qi Feng, Hubert P. H. Shum, Shigeo Morishima

PDF

Open Access 1 Repo

TL;DR

This paper introduces Depth360, a large-scale dataset for 360-degree depth estimation, and SegFuse, a novel multi-task network that improves depth prediction accuracy from omnidirectional images by leveraging geometric and temporal constraints.

Contribution

The work presents a new dataset and a multi-branch neural network architecture that enhances 360-degree depth estimation by integrating geometric, temporal, and semantic information.

Findings

01

Depth360 dataset enables better training for omnidirectional depth estimation.

02

SegFuse outperforms existing methods on benchmark tests.

03

The approach produces more consistent and detailed depth maps.

Abstract

Single-view depth estimation from omnidirectional images has gained popularity with its wide range of applications such as autonomous driving and scene reconstruction. Although data-driven learning-based methods demonstrate significant potential in this field, scarce training data and ineffective 360 estimation algorithms are still two key limitations hindering accurate estimation across diverse domains. In this work, we first establish a large-scale dataset with varied settings called Depth360 to tackle the training data problem. This is achieved by exploring the use of a plenteous source of data, 360 videos from the internet, using a test-time training method that leverages unique information in each omnidirectional sequence. With novel geometric and temporal constraints, our method generates consistent and convincing depth samples to facilitate single-view estimation. We then propose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hal-lucination/segfuse
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Robotics and Sensor-Based Localization