Statistically Profiling Biases in Natural Language Reasoning Datasets   and Models

Shanshan Huang; Kenny Q. Zhu

arXiv:2102.04632·cs.CL·February 10, 2021

Statistically Profiling Biases in Natural Language Reasoning Datasets and Models

Shanshan Huang, Kenny Q. Zhu

PDF

Open Access

TL;DR

This paper introduces ICQ, a lightweight framework that automatically detects biases in natural language reasoning datasets and assesses how models exploit these biases, addressing limitations of manual stress tests.

Contribution

The paper presents ICQ, a novel statistical profiling method that identifies biases in datasets and evaluates model reliance without additional test case creation.

Findings

01

ICQ effectively detects biases in multiple-choice NLU datasets.

02

Models tend to exploit identified biases, overestimating their reasoning capabilities.

03

ICQ provides a scalable, general approach for bias detection in NLP datasets.

Abstract

Recent work has indicated that many natural language understanding and reasoning datasets contain statistical cues that may be taken advantaged of by NLP models whose capability may thus be grossly overestimated. To discover the potential weakness in the models, some human-designed stress tests have been proposed but they are expensive to create and do not generalize to arbitrary models. We propose a light-weight and general statistical profiling framework, ICQ (I-See-Cue), which automatically identifies possible biases in any multiple-choice NLU datasets without the need to create any additional test cases, and further evaluates through blackbox testing the extent to which models may exploit these biases.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems