A Simple Background Augmentation Method for Object Detection with Diffusion Model
Yuhang Li, Xin Dong, Chen Chen, Weiming Zhuang, Lingjuan Lyu

TL;DR
This paper introduces a simple background augmentation method using diffusion models to generate diverse training data, significantly improving object detection performance without extra annotations.
Contribution
It proposes a novel data augmentation technique leveraging text-to-image synthesis for enhancing dataset diversity in object detection tasks.
Findings
Background augmentation improves model robustness and generalization.
The method yields notable performance gains on COCO and other benchmarks.
Adjusting prompts and masks ensures generated content aligns with annotations.
Abstract
In computer vision, it is well-known that a lack of data diversity will impair model performance. In this study, we address the challenges of enhancing the dataset diversity problem in order to benefit various downstream tasks such as object detection and instance segmentation. We propose a simple yet effective data augmentation approach by leveraging advancements in generative models, specifically text-to-image synthesis technologies like Stable Diffusion. Our method focuses on generating variations of labeled real images, utilizing generative object and background augmentation via inpainting to augment existing training data without the need for additional annotations. We find that background augmentation, in particular, significantly improves the models' robustness and generalization capabilities. We also investigate how to adjust the prompt and mask to ensure the generated content…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Face and Expression Recognition · Image Retrieval and Classification Techniques
MethodsDiffusion · Inpainting
