FMix: Enhancing Mixed Sample Data Augmentation
Ethan Harris, Antonia Marcu, Matthew Painter, Mahesan Niranjan, Adam, Pr\"ugel-Bennett, Jonathon Hare

TL;DR
FMix introduces a novel mixed sample data augmentation technique using random Fourier masks, improving model performance and robustness by better preserving data distribution and preventing memorization.
Contribution
This paper proposes FMix, a new MSDA method using Fourier-based masks that enhances performance and combines well with existing methods like MixUp.
Findings
FMix outperforms MixUp and CutMix on various datasets.
FMix achieves state-of-the-art results on CIFAR-10.
Combining FMix with MixUp further improves accuracy.
Abstract
Mixed Sample Data Augmentation (MSDA) has received increasing attention in recent years, with many successful variants such as MixUp and CutMix. By studying the mutual information between the function learned by a VAE on the original data and on the augmented data we show that MixUp distorts learned functions in a way that CutMix does not. We further demonstrate this by showing that MixUp acts as a form of adversarial training, increasing robustness to attacks such as Deep Fool and Uniform Noise which produce examples similar to those generated by MixUp. We argue that this distortion prevents models from learning about sample specific features in the data, aiding generalisation performance. In contrast, we suggest that CutMix works more like a traditional augmentation, improving performance by preventing memorisation without distorting the data distribution. However, we argue that an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
MethodsFMix · Average Pooling · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization
