TIDE: Textual Identity Detection for Evaluating and Augmenting   Classification and Language Models

Emmanuel Klu; Sameer Sethi

arXiv:2309.04027·cs.CL·January 15, 2024

TIDE: Textual Identity Detection for Evaluating and Augmenting Classification and Language Models

Emmanuel Klu, Sameer Sethi

PDF

Open Access

TL;DR

This paper introduces TIDAL, a comprehensive identity lexicon and an annotation tool to evaluate and improve fairness in text classifiers and language models, addressing biases related to sensitive attributes.

Contribution

The paper presents TIDAL, a new identity lexicon with 15,123 terms, and an annotation and augmentation approach to enhance fairness evaluation and mitigation in NLP models.

Findings

01

Assistive annotation improves human-in-the-loop efficiency.

02

Methods uncover more disparities in datasets and models.

03

Approaches lead to fairer models during remediation.

Abstract

Machine learning models can perpetuate unintended biases from unfair and imbalanced datasets. Evaluating and debiasing these datasets and models is especially hard in text datasets where sensitive attributes such as race, gender, and sexual orientation may not be available. When these models are deployed into society, they can lead to unfair outcomes for historically underrepresented groups. In this paper, we present a dataset coupled with an approach to improve text fairness in classifiers and language models. We create a new, more comprehensive identity lexicon, TIDAL, which includes 15,123 identity terms and associated sense context across three demographic categories. We leverage TIDAL to develop an identity annotation and augmentation tool that can be used to improve the availability of identity context and the effectiveness of ML fairness techniques. We evaluate our approaches…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Ethics and Social Impacts of AI · Topic Modeling