ADA: A Game-Theoretic Perspective on Data Augmentation for Object Detection
Sima Behpour, Kris M. Kitani, Brian D. Ziebart

TL;DR
This paper introduces a game-theoretic framework for data augmentation in object detection, deriving an optimal adversarial augmentation strategy that enhances model robustness and significantly improves test performance.
Contribution
It develops a novel game-theoretic approach to find optimal adversarial data augmentations, providing a principled method for improving object detection models.
Findings
Adversarial augmentation improves detection accuracy by over 16%.
The Nash equilibrium offers an optimal predictor and augmentation distribution.
The approach outperforms existing data augmentation methods.
Abstract
The use of random perturbations of ground truth data, such as random translation or scaling of bounding boxes, is a common heuristic used for data augmentation that has been shown to prevent overfitting and improve generalization. Since the design of data augmentation is largely guided by reported best practices, it is difficult to understand if those design choices are optimal. To provide a more principled perspective, we develop a game-theoretic interpretation of data augmentation in the context of object detection. We aim to find an optimal adversarial perturbations of the ground truth data (i.e., the worst case perturbations) that forces the object bounding box predictor to learn from the hardest distribution of perturbed examples for better test-time performance. We establish that the game theoretic solution, the Nash equilibrium, provides both an optimal predictor and optimal data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
