DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection
Jia Syuen Lim, Zhuoxiao Chen, Mahsa Baktashmotlagh, Zhi Chen, Xin Yu,, Zi Huang, Yadan Luo

TL;DR
DiPEx introduces a self-supervised prompt expansion method that enhances class-agnostic object detection by diversifying prompts, significantly improving recall and out-of-distribution detection performance using vision-language models.
Contribution
The paper proposes a novel dispersing prompt expansion (DiPEx) approach that learns to generate diverse, non-overlapping prompts to improve object detection recall and out-of-distribution detection.
Findings
Achieves up to 20.1% improvement in AR over existing methods.
Improves AP by 21.3% on MS-COCO compared to SAM.
Demonstrates effectiveness in out-of-distribution object detection.
Abstract
Class-agnostic object detection (OD) can be a cornerstone or a bottleneck for many downstream vision tasks. Despite considerable advancements in bottom-up and multi-object discovery methods that leverage basic visual cues to identify salient objects, consistently achieving a high recall rate remains difficult due to the diversity of object types and their contextual complexity. In this work, we investigate using vision-language models (VLMs) to enhance object detection via a self-supervised prompt learning strategy. Our initial findings indicate that manually crafted text queries often result in undetected objects, primarily because detection confidence diminishes when the query words exhibit semantic overlap. To address this, we propose a Dispersing Prompt Expansion (DiPEx) approach. DiPEx progressively learns to expand a set of distinct, non-overlapping hyperspherical prompts to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · COVID-19 diagnosis using AI
MethodsSparse Evolutionary Training · Segment Anything Model
