Visual Data Augmentation through Learning
Grigorios G. Chrysos, Yannis Panagakis, Stefanos Zafeiriou

TL;DR
This paper introduces a novel data augmentation method that learns realistic local transformations in images, enabling the synthesis of new, plausible images to improve machine learning models.
Contribution
The paper proposes a learning-based augmentation technique that models local transformations in images, producing more natural augmented data compared to traditional methods.
Findings
Generated images are realistic and diverse.
The method improves model invariance and robustness.
Qualitative and quantitative results validate effectiveness.
Abstract
The rapid progress in machine learning methods has been empowered by i) huge datasets that have been collected and annotated, ii) improved engineering (e.g. data pre-processing/normalization). The existing datasets typically include several million samples, which constitutes their extension a colossal task. In addition, the state-of-the-art data-driven methods demand a vast amount of data, hence a standard engineering trick employed is artificial data augmentation for instance by adding into the data cropped and (affinely) transformed images. However, this approach does not correspond to any change in the natural 3D scene. We propose instead to perform data augmentation through learning realistic local transformations. We learn a forward and an inverse transformation that maps an image from the high-dimensional space of pixel intensities to a latent space which varies (approximately)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · Advanced Image Processing Techniques
