RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of   Conversational Language Models

Soumya Barikeri; Anne Lauscher; Ivan Vuli\'c; and Goran Glava\v{s}

arXiv:2106.03521·cs.CL·June 8, 2021

RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models

Soumya Barikeri, Anne Lauscher, Ivan Vuli\'c, and Goran Glava\v{s}

PDF

1 Repo

TL;DR

RedditBias provides a new dataset and evaluation framework for measuring and mitigating societal biases in conversational language models, focusing on real Reddit conversations and multiple bias dimensions.

Contribution

This work introduces RedditBias, the first real-world conversational bias dataset, and an evaluation framework that assesses bias mitigation effects on dialog tasks.

Findings

01

DialoGPT exhibits bias towards religious groups.

02

Some debiasing methods effectively reduce bias.

03

Debiasing can preserve model performance in dialog tasks.

Abstract

Text representation models are prone to exhibit a range of societal biases, reflecting the non-controlled and biased nature of the underlying pretraining data, which consequently leads to severe ethical issues and even bias amplification. Recent work has predominantly focused on measuring and mitigating bias in pretrained language models. Surprisingly, the landscape of bias measurements and mitigation resources and methods for conversational language models is still very scarce: it is limited to only a few types of bias, artificially constructed resources, and completely ignores the impact that debiasing methods may have on the final performance in dialog tasks, e.g., conversational response generation. In this work, we present RedditBias, the first conversational data set grounded in the actual human conversations from Reddit, allowing for bias measurement and mitigation across four…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

umanlp/RedditBias
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.