Provable Weak-to-Strong Generalization via Benign Overfitting

David X. Wu; Anant Sahai

arXiv:2410.04638·cs.LG·February 3, 2025

Provable Weak-to-Strong Generalization via Benign Overfitting

David X. Wu, Anant Sahai

PDF

Open Access 3 Reviews

TL;DR

This paper investigates how a strong student model can achieve successful generalization even when supervised by a weak teacher providing pseudolabels that are nearly random, revealing two distinct phases of learning.

Contribution

It provides a theoretical analysis of weak-to-strong generalization in overparameterized models with Gaussian data, identifying conditions for successful learning despite weak supervision.

Findings

01

Identifies two asymptotic phases: successful generalization and random guessing.

02

Proves a tight lower tail inequality for the maximum of correlated Gaussians.

03

Highlights the importance of logits in multilabel weak supervision.

Abstract

The classic teacher-student model in machine learning posits that a strong teacher supervises a weak student to improve the student's capabilities. We instead consider the inverted situation, where a weak teacher supervises a strong student with imperfect pseudolabels. This paradigm was recently brought forth by Burns et al.'23 and termed \emph{weak-to-strong generalization}. We theoretically investigate weak-to-strong generalization for binary and multilabel classification in a stylized overparameterized spiked covariance model with Gaussian covariates where the weak teacher's pseudolabels are asymptotically like random guessing. Under these assumptions, we provably identify two asymptotic phases of the strong student's generalization after weak supervision: (1) successful generalization and (2) random guessing. Our techniques should eventually extend to weak-to-strong multiclass…

Peer Reviews

Decision·ICLR 2025 Poster

Reviewer 01Rating 6Confidence 3

Strengths

The math appears correct to me; the problem is significant, and desiderata 1 and desiderata 2 make sense.

Weaknesses

The paper is rather technical, and the clarity could be improved significantly to make it more readable. (see questions)

Reviewer 02Rating 6Confidence 3

Strengths

Exact characterization of the regime where weak to strong generalization occurs in terms of parameters of the covariance matrix of strong and weak features.

Weaknesses

Most of the important details are pushed into appendix. The main body only contains one useful theorem which identifies a certain condition where condition 1) of weak to strong generalization holds. Setting for condition 2) and multi class settings are merely mentioned as claims. The main body also does not provide proof sketch or provide insights into the proof of the theorem.

Reviewer 03Rating 8Confidence 2

Strengths

1) This work addresses the important problem of obtaining theoretical justification for a frequently encountered empirical phenomenon 2) The lower tail for max of correlated gaussians is an interesting result.

Weaknesses

See questions.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Signal Denoising Methods · Advanced Vision and Imaging · Medical Image Segmentation Techniques