Diverse Prototypical Ensembles Improve Robustness to Subpopulation Shift
Minh Nguyen Nhat To, Paul F RWilson, Viet Nguyen, Mohamed Harmanani, Michael Cooper, Fahimeh Fooladgar, Purang Abolmaesumi, Parvin Mousavi, Rahul G. Krishnan

TL;DR
This paper introduces Diverse Prototypical Ensembles (DPEs), a novel ensemble method that improves robustness to subpopulation shifts without requiring group annotations, outperforming existing methods on multiple real-world datasets.
Contribution
The paper proposes DPEs, an ensemble of diverse classifiers with prototypical layers, to adaptively handle subpopulation shifts without relying on group labels or assumptions.
Findings
DPEs outperform state-of-the-art methods in worst-group accuracy.
DPEs demonstrate robustness across nine diverse real-world datasets.
The approach does not require group annotations or prior assumptions.
Abstract
The subpopulationtion shift, characterized by a disparity in subpopulation distributibetween theween the training and target datasets, can significantly degrade the performance of machine learning models. Current solutions to subpopulation shift involve modifying empirical risk minimization with re-weighting strategies to improve generalization. This strategy relies on assumptions about the number and nature of subpopulations and annotations on group membership, which are unavailable for many real-world datasets. Instead, we propose using an ensemble of diverse classifiers to adaptively capture risk associated with subpopulations. Given a feature extractor network, we replace its standard linear classification layer with a mixture of prototypical classifiers, where each member is trained to classify the data while focusing on different features and samples from other members. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Imbalanced Data Classification Techniques · Machine Learning in Healthcare
