Provable Weak-to-Strong Generalization via Benign Overfitting
David X. Wu, Anant Sahai

TL;DR
This paper investigates how a strong student model can achieve successful generalization even when supervised by a weak teacher providing pseudolabels that are nearly random, revealing two distinct phases of learning.
Contribution
It provides a theoretical analysis of weak-to-strong generalization in overparameterized models with Gaussian data, identifying conditions for successful learning despite weak supervision.
Findings
Identifies two asymptotic phases: successful generalization and random guessing.
Proves a tight lower tail inequality for the maximum of correlated Gaussians.
Highlights the importance of logits in multilabel weak supervision.
Abstract
The classic teacher-student model in machine learning posits that a strong teacher supervises a weak student to improve the student's capabilities. We instead consider the inverted situation, where a weak teacher supervises a strong student with imperfect pseudolabels. This paradigm was recently brought forth by Burns et al.'23 and termed \emph{weak-to-strong generalization}. We theoretically investigate weak-to-strong generalization for binary and multilabel classification in a stylized overparameterized spiked covariance model with Gaussian covariates where the weak teacher's pseudolabels are asymptotically like random guessing. Under these assumptions, we provably identify two asymptotic phases of the strong student's generalization after weak supervision: (1) successful generalization and (2) random guessing. Our techniques should eventually extend to weak-to-strong multiclass…
Peer Reviews
Decision·ICLR 2025 Poster
The math appears correct to me; the problem is significant, and desiderata 1 and desiderata 2 make sense.
The paper is rather technical, and the clarity could be improved significantly to make it more readable. (see questions)
Exact characterization of the regime where weak to strong generalization occurs in terms of parameters of the covariance matrix of strong and weak features.
Most of the important details are pushed into appendix. The main body only contains one useful theorem which identifies a certain condition where condition 1) of weak to strong generalization holds. Setting for condition 2) and multi class settings are merely mentioned as claims. The main body also does not provide proof sketch or provide insights into the proof of the theorem.
1) This work addresses the important problem of obtaining theoretical justification for a frequently encountered empirical phenomenon 2) The lower tail for max of correlated gaussians is an interesting result.
See questions.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods · Advanced Vision and Imaging · Medical Image Segmentation Techniques
