An Attribute-Enriched Dataset and Auto-Annotated Pipeline for Open Detection
Pengfei Qi, Yifei Zhang, Wenqiang Li, Youwen Hu, Kunlong Bai

TL;DR
This paper introduces the Objects365-Attr dataset with extensive attribute annotations to improve object detection, especially for complex or uncommon objects, and evaluates its impact using YOLO-World models.
Contribution
The creation of a large-scale attribute-enriched dataset, Objects365-Attr, and an evaluation of its effectiveness in enhancing object detection performance.
Findings
The dataset contains 5.6 million attribute descriptions across 1.4 million bounding boxes.
Attribute annotations help reduce detection inconsistencies for complex objects.
Evaluation shows improved detection performance with the new dataset.
Abstract
Detecting objects of interest through language often presents challenges, particularly with objects that are uncommon or complex to describe, due to perceptual discrepancies between automated models and human annotators. These challenges highlight the need for comprehensive datasets that go beyond standard object labels by incorporating detailed attribute descriptions. To address this need, we introduce the Objects365-Attr dataset, an extension of the existing Objects365 dataset, distinguished by its attribute annotations. This dataset reduces inconsistencies in object detection by integrating a broad spectrum of attributes, including color, material, state, texture and tone. It contains an extensive collection of 5.6M object-level attribute descriptions, meticulously annotated across 1.4M bounding boxes. Additionally, to validate the dataset's effectiveness, we conduct a rigorous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Machine Learning and Data Classification
