A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
Chaoyang Zhu, and Long Chen

TL;DR
This survey reviews recent advances in open-vocabulary detection and segmentation, highlighting methodologies, challenges, and future directions for models that recognize objects beyond pre-defined categories.
Contribution
It provides a comprehensive taxonomy and analysis of recent open-vocabulary detection and segmentation methods, covering various supervision signals and tasks.
Findings
Weak supervision signals improve model generalization
Different methodologies are categorized by supervision type
Benchmarking results highlight strengths and weaknesses
Abstract
As the most fundamental scene understanding tasks, object detection and segmentation have made tremendous progress in deep learning era. Due to the expensive manual labeling cost, the annotated categories in existing datasets are often small-scale and pre-defined, i.e., state-of-the-art fully-supervised detectors and segmentors fail to generalize beyond the closed vocabulary. To resolve this limitation, in the last few years, the community has witnessed an increasing attention toward Open-Vocabulary Detection (OVD) and Segmentation (OVS). By ``open-vocabulary'', we mean that the models can classify objects beyond pre-defined categories. In this survey, we provide a comprehensive review on recent developments of OVD and OVS. A taxonomy is first developed to organize different tasks and methodologies. We find that the permission and usage of weak supervision signals can well discriminate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
Methodsfail
