Image compositing is all you need for data augmentation

Ang Jia Ning Shermaine; Michalis Lazarou; Tania Stathaki

arXiv:2502.13936·cs.CV·February 20, 2025

Image compositing is all you need for data augmentation

Ang Jia Ning Shermaine, Michalis Lazarou, Tania Stathaki

PDF

Open Access

TL;DR

This paper demonstrates that image compositing significantly improves object detection accuracy and robustness, outperforming other augmentation methods like generative models, especially with limited data.

Contribution

It introduces the effectiveness of image compositing as a data augmentation technique for object detection, showing superior performance over traditional and generative methods.

Findings

01

Image compositing yields the highest detection performance improvements.

02

Generative models like Stable Diffusion XL also provide notable gains.

03

Augmentation enhances model robustness and generalization.

Abstract

This paper investigates the impact of various data augmentation techniques on the performance of object detection models. Specifically, we explore classical augmentation methods, image compositing, and advanced generative models such as Stable Diffusion XL and ControlNet. The objective of this work is to enhance model robustness and improve detection accuracy, particularly when working with limited annotated data. Using YOLOv8, we fine-tune the model on a custom dataset consisting of commercial and military aircraft, applying different augmentation strategies. Our experiments show that image compositing offers the highest improvement in detection performance, as measured by precision, recall, and mean Average Precision ([email protected]). Other methods, including Stable Diffusion XL and ControlNet, also demonstrate significant gains, highlighting the potential of advanced data augmentation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Imaging in Medicine

MethodsDiffusion · You Only Look Once