Improving Object Detector Training on Synthetic Data by Starting With a   Strong Baseline Methodology

Frank A. Ruis; Alma M. Liezenga; Friso G. Heslinga; Luca; Ballan; Thijs A. Eker; Richard J. M. den Hollander; Martin C. van; Leeuwen; Judith Dijk; Wyke Huizinga

arXiv:2405.19822·cs.CV·May 31, 2024

Improving Object Detector Training on Synthetic Data by Starting With a Strong Baseline Methodology

Frank A. Ruis, Alma M. Liezenga, Friso G. Heslinga, Luca, Ballan, Thijs A. Eker, Richard J. M. den Hollander, Martin C. van, Leeuwen, Judith Dijk, Wyke Huizinga

PDF

Open Access

TL;DR

This paper presents a methodology to enhance object detector training on synthetic data by leveraging a strong baseline, data augmentation, and a Transformer backbone, achieving state-of-the-art results and near-perfect performance.

Contribution

The paper introduces a comprehensive methodology that improves synthetic data training for object detection by combining best practices, data augmentation, and Transformer architectures.

Findings

01

Improved performance on synthetic datasets like RarePlanes and DGTA-VisDrone.

02

Achieved near-perfect detection accuracy on an in-house vehicle dataset.

03

Enhanced baseline models with data augmentation and Transformer backbones.

Abstract

Collecting and annotating real-world data for the development of object detection models is a time-consuming and expensive process. In the military domain in particular, data collection can also be dangerous or infeasible. Training models on synthetic data may provide a solution for cases where access to real-world training data is restricted. However, bridging the reality gap between synthetic and real data remains a challenge. Existing methods usually build on top of baseline Convolutional Neural Network (CNN) models that have been shown to perform well when trained on real data, but have limited ability to perform well when trained on synthetic data. For example, some architectures allow for fine-tuning with the expectation of large quantities of training data and are prone to overfitting on synthetic data. Related work usually ignores various best practices from object detection on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Neural Networks and Applications · Machine Learning and Algorithms

MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention · Dropout · Dense Connections