TL;DR
DART is an automated pipeline that enhances object detection accuracy and efficiency by automating data diversification, annotation, review, and training, reducing manual effort and adapting to new environments.
Contribution
This paper introduces DART, a novel end-to-end automated object detection pipeline that integrates data generation, open-vocabulary annotation, pseudo-label review, and model training.
Findings
Significantly increased average precision from 0.064 to 0.832.
Automated pipeline reduces manual labeling effort.
Modular design allows easy upgrades and adaptation.
Abstract
Accurate real-time object detection is vital across numerous industrial applications, from safety monitoring to quality control. Traditional approaches, however, are hindered by arduous manual annotation and data collection, struggling to adapt to ever-changing environments and novel target objects. To address these limitations, this paper presents DART, an innovative automated end-to-end pipeline that revolutionizes object detection workflows from data collection to model evaluation. It eliminates the need for laborious human labeling and extensive data collection while achieving outstanding accuracy across diverse scenarios. DART encompasses four key stages: (1) Data Diversification using subject-driven image generation (DreamBooth with SDXL), (2) Annotation via open-vocabulary object detection (Grounding DINO) to generate bounding box and class labels, (3) Review of generated images…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDifficulty-Aware Rejection Tuning
