Outlier-Robust Group Inference via Gradient Space Clustering
Yuchen Zeng, Kristjan Greenewald, Kangwook Lee, Justin Solomon,, Mikhail Yurochkin

TL;DR
This paper introduces a gradient space clustering method to identify minority groups and outliers without needing explicit group annotations, significantly improving worst-group performance in machine learning models.
Contribution
The paper proposes a novel approach that clusters data in gradient space to simultaneously handle outliers and identify minority groups without requiring group labels.
Findings
Outperforms state-of-the-art in group identification
Improves worst-group performance substantially
Effective in the presence of outliers
Abstract
Traditional machine learning models focus on achieving good performance on the overall training distribution, but they often underperform on minority groups. Existing methods can improve the worst-group performance, but they can have several limitations: (i) they require group annotations, which are often expensive and sometimes infeasible to obtain, and/or (ii) they are sensitive to outliers. Most related works fail to solve these two issues simultaneously as they focus on conflicting perspectives of minority groups and outliers. We address the problem of learning group annotations in the presence of outliers by clustering the data in the space of gradients of the model parameters. We show that data in the gradient space has a simpler structure while preserving information about minority groups and outliers, making it suitable for standard clustering methods like DBSCAN. Extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · COVID-19 diagnosis using AI · Imbalanced Data Classification Techniques
