FlatMatch: Bridging Labeled Data and Unlabeled Data with Cross-Sharpness for Semi-Supervised Learning
Zhuo Huang, Li Shen, Jun Yu, Bo Han, Tongliang Liu

TL;DR
FlatMatch introduces a novel cross-sharpness measure to improve semi-supervised learning by aligning the learning process on labeled and unlabeled data, leading to better generalization and state-of-the-art results.
Contribution
The paper proposes FlatMatch, a method that minimizes cross-sharpness to effectively propagate label guidance and enhance semi-supervised learning performance.
Findings
Achieves state-of-the-art results in various SSL benchmarks.
Effectively mitigates mismatched learning performance between labeled and unlabeled data.
Improves generalization by calibrating learning with worst-case risk and cross-sharpness.
Abstract
Semi-Supervised Learning (SSL) has been an effective way to leverage abundant unlabeled data with extremely scarce labeled data. However, most SSL methods are commonly based on instance-wise consistency between different data transformations. Therefore, the label guidance on labeled data is hard to be propagated to unlabeled data. Consequently, the learning process on labeled data is much faster than on unlabeled data which is likely to fall into a local minima that does not favor unlabeled data, leading to sub-optimal generalization performance. In this paper, we propose FlatMatch which minimizes a cross-sharpness measure to ensure consistent learning performance between the two datasets. Specifically, we increase the empirical risk on labeled data to obtain a worst-case model which is a failure case that needs to be enhanced. Then, by leveraging the richness of unlabeled data, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Infrastructure Maintenance and Monitoring
