Mining the Minoria: Unknown, Under-represented, and Under-performing Minority Groups
Mohsen Dehghankar, Abolfazl Asudeh

TL;DR
This paper introduces a geometric approach to identify hidden, under-represented, and under-performing minority groups in data without prior grouping information, addressing a critical challenge in responsible machine learning.
Contribution
It proposes a novel minority mining framework using geometric transformations and hyperplane arrangements, with solutions for both low and high-dimensional data.
Findings
Effective identification of minority groups demonstrated on real-world datasets
Proposed algorithms outperform baseline methods in detecting under-represented minorities
Theoretical analysis supports the robustness of the approach
Abstract
Due to a variety of reasons, such as privacy, data in the wild often misses the grouping information required for identifying minorities. On the other hand, it is known that machine learning models are only as good as the data they are trained on and, hence, may underperform for the under-represented minority groups. The missing grouping information presents a dilemma for responsible data scientists who find themselves in an unknown-unknown situation, where not only do they not have access to the grouping attributes but do not also know what groups to consider. This paper is an attempt to address this dilemma. Specifically, we propose a minority mining problem, where we find vectors in the attribute space that reveal potential groups that are under-represented and under-performing. Technically speaking, we propose a geometric transformation of data into a dual space and use notions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPost-Soviet Geopolitical Dynamics
