InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type   Performance in Indoor Monocular Depth

Cho-Ying Wu; Quankai Gao; Chin-Cheng Hsu; Te-Lin Wu; Jing-Wen Chen,; Ulrich Neumann

arXiv:2408.13708·cs.CV·August 27, 2024

InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular Depth

Cho-Ying Wu, Quankai Gao, Chin-Cheng Hsu, Te-Lin Wu, Jing-Wen Chen,, Ulrich Neumann

PDF

Open Access 1 Repo

TL;DR

This paper introduces InSpaceType, a new RGBD dataset and benchmark to evaluate and analyze the robustness and generalization of monocular depth estimation models across diverse indoor space types, revealing performance biases and guiding better data curation.

Contribution

It presents the InSpaceType dataset and benchmark, providing a detailed analysis of model performance variances across indoor space types and offering insights for improving robustness.

Findings

01

Most models show performance imbalance between common and rare space types.

02

Top methods exhibit even more severe performance disparities.

03

Synthetic data curation influences model generalization.

Abstract

Indoor monocular depth estimation helps home automation, including robot navigation or AR/VR for surrounding perception. Most previous methods primarily experiment with the NYUv2 Dataset and concentrate on the overall performance in their evaluation. However, their robustness and generalization to diversely unseen types or categories for indoor spaces (spaces types) have yet to be discovered. Researchers may empirically find degraded performance in a released pretrained model on custom data or less-frequent types. This paper studies the common but easily overlooked factor-space type and realizes a model's performance variances across spaces. We present InSpaceType Dataset, a high-quality RGBD dataset for general indoor scenes, and benchmark 13 recent state-of-the-art methods on InSpaceType. Our examination shows that most of them suffer from performance imbalance between head and tailed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

DepthComputation/InSpaceType_Benchmark
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Surveying and Cultural Heritage · Industrial Vision Systems and Defect Detection