G-Mix: A Generalized Mixup Learning Framework Towards Flat Minima
Xingyu Li, Bo Tang

TL;DR
G-Mix is a new training framework that combines Mixup and Sharpness-Aware Minimization to improve the generalization of deep neural networks, especially with limited data, by addressing manifold intrusion and optimizing sharpness sensitivity.
Contribution
The paper introduces G-Mix, a novel framework integrating Mixup and SAM, along with two algorithms, Binary G-Mix and Decomposed G-Mix, to enhance DNN generalization and address manifold intrusion.
Findings
G-Mix improves model generalization across multiple datasets.
Binary G-Mix and Decomposed G-Mix outperform existing methods.
Achieves state-of-the-art performance in experiments.
Abstract
Deep neural networks (DNNs) have demonstrated promising results in various complex tasks. However, current DNNs encounter challenges with over-parameterization, especially when there is limited training data available. To enhance the generalization capability of DNNs, the Mixup technique has gained popularity. Nevertheless, it still produces suboptimal outcomes. Inspired by the successful Sharpness-Aware Minimization (SAM) approach, which establishes a connection between the sharpness of the training loss landscape and model generalization, we propose a new learning framework called Generalized-Mixup, which combines the strengths of Mixup and SAM for training DNN models. The theoretical analysis provided demonstrates how the developed G-Mix framework enhances generalization. Additionally, to further optimize DNN performance with the G-Mix framework, we introduce two novel algorithms:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Human Pose and Action Recognition · Face and Expression Recognition
MethodsSegment Anything Model · Mixup · Sharpness-Aware Minimization
