ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models
Jingyuan Zhu, Shiyu Li, Yuxuan Liu, Ping Huang, Jiulong Shan, Huimin, Ma, Jian Yuan

TL;DR
ODGEN is a diffusion model-based data generation method that creates high-quality, controllable images with multi-class and occluded objects, significantly improving object detection performance across various domains.
Contribution
The paper introduces ODGEN, a novel diffusion-based approach for domain-specific image synthesis conditioned on bounding boxes, enhancing data quality and controllability for object detection.
Findings
Improves [email protected]:.95 by up to 25.3% on object detectors.
Outperforms prior controllable generative methods.
Demonstrates robustness in complex scenes and specific domains.
Abstract
Modern diffusion-based image generative models have made significant progress and become promising to enrich training data for the object detection task. However, the generation quality and the controllability for complex scenes containing multi-class objects and dense objects with occlusions remain limited. This paper presents ODGEN, a novel method to generate high-quality images conditioned on bounding boxes, thereby facilitating data synthesis for object detection. Given a domain-specific object detection dataset, we first fine-tune a pre-trained diffusion model on both cropped foreground objects and entire images to fit target distributions. Then we propose to control the diffusion model using synthesized visual prompts with spatial constraints and object-wise textual descriptions. ODGEN exhibits robustness in handling complex scenes and specific domains. Further, we design a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Image and Video Retrieval Techniques · Advanced Neural Network Applications
MethodsDiffusion
