GOOD: Exploring Geometric Cues for Detecting Objects in an Open World

Haiwen Huang; Andreas Geiger; Dan Zhang

arXiv:2212.11720·cs.CV·February 6, 2023·5 cites

GOOD: Exploring Geometric Cues for Detecting Objects in an Open World

Haiwen Huang, Andreas Geiger, Dan Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces GOOD, a geometric cue-based approach for open-world object detection that leverages depth and normals to improve detection of novel objects, outperforming RGB-only models.

Contribution

The paper proposes a novel method incorporating geometric cues into object detection, enhancing detection of unseen objects in an open-world setting.

Findings

01

GOOD surpasses SOTA by 5.0% AR@100 with only one training class.

02

Geometric cues improve detection recall for novel categories.

03

The approach performs well with limited training data.

Abstract

We address the task of open-world class-agnostic object detection, i.e., detecting every object in an image by learning from a limited number of base object classes. State-of-the-art RGB-based models suffer from overfitting the training classes and often fail at detecting novel-looking objects. This is because RGB-based models primarily rely on appearance similarity to detect novel objects and are also prone to overfitting short-cut cues such as textures and discriminative parts. To address these shortcomings of RGB-based object detectors, we propose incorporating geometric cues such as depth and normals, predicted by general-purpose monocular estimators. Specifically, we use the geometric cues to train an object proposal network for pseudo-labeling unannotated novel objects in the training set. Our resulting Geometry-guided Open-world Object Detector (GOOD) significantly improves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

autonomousvision/good
pytorchOfficial

Videos

GOOD: Exploring geometric cues for detecting objects in an open world· slideslive

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques

Methodsfail · Balanced Selection