Towards Group Robustness in the presence of Partial Group Labels
Vishnu Suresh Lokhande, Kihyuk Sohn, Jinsung Yoon, Madeleine Udell,, Chen-Yu Lee, Tomas Pfister

TL;DR
This paper addresses the challenge of training robust machine learning models with partial group labels by proposing a method that leverages partially available group information to improve minority group performance without sacrificing overall accuracy.
Contribution
The paper introduces a novel approach that constructs a constraint set for group assignments and optimizes for the worst-off groups, filling a gap between fully supervised and unsupervised group robustness methods.
Findings
Improved minority group performance in experiments.
Maintained overall accuracy across groups.
Effective in image and tabular datasets.
Abstract
Learning invariant representations is an important requirement when training machine learning models that are driven by spurious correlations in the datasets. These spurious correlations, between input samples and the target labels, wrongly direct the neural network predictions resulting in poor performance on certain groups, especially the minority groups. Robust training against these spurious correlations requires the knowledge of group membership for every sample. Such a requirement is impractical in situations where the data labeling efforts for minority or rare groups are significantly laborious or where the individuals comprising the dataset choose to conceal sensitive information. On the other hand, the presence of such data collection efforts results in datasets that contain partially labeled group information. Recent works have tackled the fully unsupervised scenario where no…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
