On Weak-to-Strong Generalization and f-Divergence
Wei Yao, Gengze Xu, Huayi Tang, Wenkai Yang, Donglin Di, Ziqiao Wang, Yong Liu

TL;DR
This paper introduces the use of $f$-divergence as a loss function in weak-to-strong generalization, providing theoretical insights and empirical evidence of its effectiveness in improving model generalization and noise robustness.
Contribution
It proposes an $f$-divergence framework for W2SG, analyzes its theoretical properties, and demonstrates practical benefits over existing methods.
Findings
$f$-divergence loss improves generalization performance.
Theoretical analysis reveals fundamental limitations and equivalences.
Empirical results show enhanced noise tolerance in models.
Abstract
Weak-to-strong generalization (W2SG) has emerged as a promising paradigm for stimulating the capabilities of strong pre-trained models by leveraging supervision from weaker supervisors. To improve the performance of the strong model, existing methods often require additional weak models or complex procedures, leading to substantial computational and memory overhead. Motivated by the effectiveness of -divergence loss in various machine learning domains, we introduce -divergence as an information-theoretic loss function framework in W2SG. Our theoretical analysis reveals fundamental limitations and equivalence of different -divergence losses in W2SG, supported by sample complexity bounds and information-theoretic insights. We empirically demonstrate that -divergence loss, which generalizes widely-used metrics like KL divergence, effectively improves generalization and noise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsApproximation Theory and Sequence Spaces · Optimization and Variational Analysis · Fuzzy and Soft Set Theory
