Improving Object Detector Training on Synthetic Data by Starting With a Strong Baseline Methodology
Frank A. Ruis, Alma M. Liezenga, Friso G. Heslinga, Luca, Ballan, Thijs A. Eker, Richard J. M. den Hollander, Martin C. van, Leeuwen, Judith Dijk, Wyke Huizinga

TL;DR
This paper presents a methodology to enhance object detector training on synthetic data by leveraging a strong baseline, data augmentation, and a Transformer backbone, achieving state-of-the-art results and near-perfect performance.
Contribution
The paper introduces a comprehensive methodology that improves synthetic data training for object detection by combining best practices, data augmentation, and Transformer architectures.
Findings
Improved performance on synthetic datasets like RarePlanes and DGTA-VisDrone.
Achieved near-perfect detection accuracy on an in-house vehicle dataset.
Enhanced baseline models with data augmentation and Transformer backbones.
Abstract
Collecting and annotating real-world data for the development of object detection models is a time-consuming and expensive process. In the military domain in particular, data collection can also be dangerous or infeasible. Training models on synthetic data may provide a solution for cases where access to real-world training data is restricted. However, bridging the reality gap between synthetic and real data remains a challenge. Existing methods usually build on top of baseline Convolutional Neural Network (CNN) models that have been shown to perform well when trained on real data, but have limited ability to perform well when trained on synthetic data. For example, some architectures allow for fine-tuning with the expectation of large quantities of training data and are prone to overfitting on synthetic data. Related work usually ignores various best practices from object detection on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Neural Networks and Applications · Machine Learning and Algorithms
MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention · Dropout · Dense Connections
