Centering the Margins: Outlier-Based Identification of Harmed Populations in Toxicity Detection
Vyoma Raman, Eve Fleisig, Dan Klein

TL;DR
This paper introduces an outlier-based method to identify marginalized groups in toxicity detection, revealing that models perform worse on these groups and that this approach uncovers broader and more intersectional harms than traditional demographic analysis.
Contribution
It operationalizes the concept of 'margins' using outlier detection to better identify vulnerable populations and their specific harms in toxicity detection models.
Findings
Model performance is up to 70.4% worse for demographic outliers.
Text outliers experience up to 68.4% higher MSE in toxicity detection.
Outlier analysis reveals broader and more intersectional harms than traditional methods.
Abstract
The impact of AI models on marginalized communities has traditionally been measured by identifying performance differences between specified demographic subgroups. Though this approach aims to center vulnerable groups, it risks obscuring patterns of harm faced by intersectional subgroups or shared across multiple groups. To address this, we draw on theories of marginalization from disability studies and related disciplines, which state that people farther from the norm face greater adversity, to consider the "margins" in the domain of toxicity detection. We operationalize the "margins" of a dataset by employing outlier detection to identify text about people with demographic attributes distant from the "norm". We find that model performance is consistently worse for demographic outliers, with mean squared error (MSE) between outliers and non-outliers up to 70.4% worse across toxicity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOccupational Health and Safety Research
