DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model Training

Chen Xin; Andreas Hartel; Enkelejda Kasneci

arXiv:2407.09174·cs.CV·June 24, 2025

DART: An Automated End-to-End Object Detection Pipeline with Data Diversification, Open-Vocabulary Bounding Box Annotation, Pseudo-Label Review, and Model Training

Chen Xin, Andreas Hartel, Enkelejda Kasneci

PDF

1 Repo

TL;DR

DART is an automated pipeline that enhances object detection accuracy and efficiency by automating data diversification, annotation, review, and training, reducing manual effort and adapting to new environments.

Contribution

This paper introduces DART, a novel end-to-end automated object detection pipeline that integrates data generation, open-vocabulary annotation, pseudo-label review, and model training.

Findings

01

Significantly increased average precision from 0.064 to 0.832.

02

Automated pipeline reduces manual labeling effort.

03

Modular design allows easy upgrades and adaptation.

Abstract

Accurate real-time object detection is vital across numerous industrial applications, from safety monitoring to quality control. Traditional approaches, however, are hindered by arduous manual annotation and data collection, struggling to adapt to ever-changing environments and novel target objects. To address these limitations, this paper presents DART, an innovative automated end-to-end pipeline that revolutionizes object detection workflows from data collection to model evaluation. It eliminates the need for laborious human labeling and extensive data collection while achieving outstanding accuracy across diverse scenarios. DART encompasses four key stages: (1) Data Diversification using subject-driven image generation (DreamBooth with SDXL), (2) Annotation via open-vocabulary object detection (Grounding DINO) to generate bounding box and class labels, (3) Review of generated images…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chen-xin-94/dart
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDifficulty-Aware Rejection Tuning