Provable Benefit of Mixup for Finding Optimal Decision Boundaries

Junsoo Oh; Chulhee Yun

arXiv:2306.00267·cs.LG·June 7, 2023·1 cites

Provable Benefit of Mixup for Finding Optimal Decision Boundaries

Junsoo Oh, Chulhee Yun

PDF

Open Access 1 Video

TL;DR

This paper demonstrates that Mixup data augmentation reduces the sample complexity in finding optimal decision boundaries in binary classification, especially for highly separable data, and analyzes its theoretical benefits and limitations.

Contribution

The paper provides a theoretical analysis of Mixup's benefit in reducing sample complexity and introduces new concentration results for pair-wise augmented data.

Findings

01

Mixup mitigates the curse of separability by reducing sample complexity.

02

Vanilla training's sample complexity increases exponentially with data separability.

03

Other masking-based Mixup techniques can distort training loss and lead to suboptimal classifiers.

Abstract

We investigate how pair-wise data augmentation techniques like Mixup affect the sample complexity of finding optimal decision boundaries in a binary linear classification problem. For a family of data distributions with a separability constant $κ$ , we analyze how well the optimal classifier in terms of training loss aligns with the optimal one in test accuracy (i.e., Bayes optimal classifier). For vanilla training without augmentation, we uncover an interesting phenomenon named the curse of separability. As we increase $κ$ to make the data distribution more separable, the sample complexity of vanilla training increases exponentially in $κ$ ; perhaps surprisingly, the task of finding optimal decision boundaries becomes harder for more separable distributions. For Mixup training, we show that Mixup mitigates this problem by significantly reducing the sample complexity. To…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Provable Benefit of Mixup for Finding Optimal Decision Boundaries· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Imbalanced Data Classification Techniques

MethodsTest · Mixup