TIDE: Textual Identity Detection for Evaluating and Augmenting Classification and Language Models
Emmanuel Klu, Sameer Sethi

TL;DR
This paper introduces TIDAL, a comprehensive identity lexicon and an annotation tool to evaluate and improve fairness in text classifiers and language models, addressing biases related to sensitive attributes.
Contribution
The paper presents TIDAL, a new identity lexicon with 15,123 terms, and an annotation and augmentation approach to enhance fairness evaluation and mitigation in NLP models.
Findings
Assistive annotation improves human-in-the-loop efficiency.
Methods uncover more disparities in datasets and models.
Approaches lead to fairer models during remediation.
Abstract
Machine learning models can perpetuate unintended biases from unfair and imbalanced datasets. Evaluating and debiasing these datasets and models is especially hard in text datasets where sensitive attributes such as race, gender, and sexual orientation may not be available. When these models are deployed into society, they can lead to unfair outcomes for historically underrepresented groups. In this paper, we present a dataset coupled with an approach to improve text fairness in classifiers and language models. We create a new, more comprehensive identity lexicon, TIDAL, which includes 15,123 identity terms and associated sense context across three demographic categories. We leverage TIDAL to develop an identity annotation and augmentation tool that can be used to improve the availability of identity context and the effectiveness of ML fairness techniques. We evaluate our approaches…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Ethics and Social Impacts of AI · Topic Modeling
