Teach me how to Interpolate a Myriad of Embeddings
Shashanka Venkataramanan, Ewa Kijak, Laurent Amsaleg, Yannis Avrithis

TL;DR
This paper introduces MultiMix, a novel interpolation method that combines multiple embeddings at once, improving data augmentation and class clustering in embedding space, leading to better performance on benchmarks.
Contribution
The paper proposes MultiMix, allowing interpolation of multiple tuples with individual vectors, dense spatial interpolation for sequence data, and a self-distillation approach for consistent target generation.
Findings
Significant performance improvements over state-of-the-art mixup methods.
Classes are more tightly clustered and uniformly spread in embedding space.
Enhanced data augmentation efficiency with minimal additional cost.
Abstract
Mixup refers to interpolation-based data augmentation, originally motivated as a way to go beyond empirical risk minimization (ERM). Yet, its extensions focus on the definition of interpolation and the space where it takes place, while the augmentation itself is less studied: For a mini-batch of size , most methods interpolate between pairs with a single scalar interpolation factor . In this work, we make progress in this direction by introducing MultiMix, which interpolates an arbitrary number of tuples, each of length , with one vector per tuple. On sequence data, we further extend to dense interpolation and loss computation over all spatial positions. Overall, we increase the number of tuples per mini-batch by orders of magnitude at little additional cost. This is possible by interpolating at the very last layer before the classifier. Finally, to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications
MethodsMixup
