InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular Depth
Cho-Ying Wu, Quankai Gao, Chin-Cheng Hsu, Te-Lin Wu, Jing-Wen Chen,, Ulrich Neumann

TL;DR
This paper introduces InSpaceType, a new RGBD dataset and benchmark to evaluate and analyze the robustness and generalization of monocular depth estimation models across diverse indoor space types, revealing performance biases and guiding better data curation.
Contribution
It presents the InSpaceType dataset and benchmark, providing a detailed analysis of model performance variances across indoor space types and offering insights for improving robustness.
Findings
Most models show performance imbalance between common and rare space types.
Top methods exhibit even more severe performance disparities.
Synthetic data curation influences model generalization.
Abstract
Indoor monocular depth estimation helps home automation, including robot navigation or AR/VR for surrounding perception. Most previous methods primarily experiment with the NYUv2 Dataset and concentrate on the overall performance in their evaluation. However, their robustness and generalization to diversely unseen types or categories for indoor spaces (spaces types) have yet to be discovered. Researchers may empirically find degraded performance in a released pretrained model on custom data or less-frequent types. This paper studies the common but easily overlooked factor-space type and realizes a model's performance variances across spaces. We present InSpaceType Dataset, a high-quality RGBD dataset for general indoor scenes, and benchmark 13 recent state-of-the-art methods on InSpaceType. Our examination shows that most of them suffer from performance imbalance between head and tailed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Surveying and Cultural Heritage · Industrial Vision Systems and Defect Detection
