GOOD: Exploring Geometric Cues for Detecting Objects in an Open World
Haiwen Huang, Andreas Geiger, Dan Zhang

TL;DR
This paper introduces GOOD, a geometric cue-based approach for open-world object detection that leverages depth and normals to improve detection of novel objects, outperforming RGB-only models.
Contribution
The paper proposes a novel method incorporating geometric cues into object detection, enhancing detection of unseen objects in an open-world setting.
Findings
GOOD surpasses SOTA by 5.0% AR@100 with only one training class.
Geometric cues improve detection recall for novel categories.
The approach performs well with limited training data.
Abstract
We address the task of open-world class-agnostic object detection, i.e., detecting every object in an image by learning from a limited number of base object classes. State-of-the-art RGB-based models suffer from overfitting the training classes and often fail at detecting novel-looking objects. This is because RGB-based models primarily rely on appearance similarity to detect novel objects and are also prone to overfitting short-cut cues such as textures and discriminative parts. To address these shortcomings of RGB-based object detectors, we propose incorporating geometric cues such as depth and normals, predicted by general-purpose monocular estimators. Specifically, we use the geometric cues to train an object proposal network for pseudo-labeling unannotated novel objects in the training set. Our resulting Geometry-guided Open-world Object Detector (GOOD) significantly improves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
Methodsfail · Balanced Selection
