SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models
Haozhe An, Zongxia Li, Jieyu Zhao, Rachel Rudinger

TL;DR
SODAPOP is a novel method that uncovers hidden social biases in NLP models by generating diverse test instances through demographic substitutions and distractor answers, revealing biases not pre-specified by designers.
Contribution
The paper introduces SODAPOP, a new open-ended approach for discovering social biases in social commonsense reasoning models that overcomes limitations of traditional diagnostic tests.
Findings
SODAPOP uncovers biases in models that are missed by existing tests.
State-of-the-art debiasing methods have limited effectiveness.
The method reveals stereotypic associations between demographic groups and words.
Abstract
A common limitation of diagnostic tests for detecting social biases in NLP models is that they may only detect stereotypic associations that are pre-specified by the designer of the test. Since enumerating all possible problematic associations is infeasible, it is likely these tests fail to detect biases that are present in a model but not pre-specified by the designer. To address this limitation, we propose SODAPOP (SOcial bias Discovery from Answers about PeOPle) in social commonsense question-answering. Our pipeline generates modified instances from the Social IQa dataset (Sap et al., 2019) by (1) substituting names associated with different demographic groups, and (2) generating many distractor answers from a masked language model. By using a social commonsense model to score the generated distractors, we are able to uncover the model's stereotypic associations between demographic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsTest
