How does overparametrization affect performance on minority groups?
Subha Maity, Saptarshi Roy, Songkai Xue, Mikhail Yurochkin, Yuekai Sun

TL;DR
This paper investigates how overparameterization influences the performance of machine learning models on minority groups, combining empirical evidence with theoretical analysis to show consistent improvements for minority groups.
Contribution
It provides a theoretical understanding of overparameterization's positive effect on minority group performance in ML models, complementing existing empirical findings.
Findings
Overparameterization improves minority group performance.
Empirical results show better worst-group accuracy with overparameterized models.
Theoretical analysis confirms consistent minority group benefits in certain settings.
Abstract
The benefits of overparameterization for the overall performance of modern machine learning (ML) models are well known. However, the effect of overparameterization at a more granular level of data subgroups is less understood. Recent empirical studies demonstrate encouraging results: (i) when groups are not known, overparameterized models trained with empirical risk minimization (ERM) perform better on minority groups; (ii) when groups are known, ERM on data subsampled to equalize group sizes yields state-of-the-art worst-group-accuracy in the overparameterized regime. In this paper, we complement these empirical studies with a theoretical investigation of the risk of overparameterized random feature models on minority groups. In a setting in which the regression functions for the majority and minority groups are different, we show that overparameterization always improves minority…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Recommender Systems and Techniques
