Competency Problems: On Finding and Removing Artifacts in Language Data

Matt Gardner; William Merrill; Jesse Dodge; Matthew E. Peters; Alexis; Ross; Sameer Singh; Noah A. Smith

arXiv:2104.08646·cs.CL·December 30, 2021

Competency Problems: On Finding and Removing Artifacts in Language Data

Matt Gardner, William Merrill, Jesse Dodge, Matthew E. Peters, Alexis, Ross, Sameer Singh, Noah A. Smith

PDF

Open Access

TL;DR

This paper introduces the concept of competency problems in NLP, emphasizing that simple feature correlations are often spurious and proposing methods to identify and mitigate dataset artifacts and biases.

Contribution

It formalizes competency problems, provides a statistical test for dataset artifacts, and analyzes the difficulty of creating unbiased datasets for complex language understanding.

Findings

01

Models are affected by subtle dataset biases

02

Realistic datasets deviate from true competency as they grow

03

Proposed statistical test detects spurious correlations

Abstract

Much recent work in NLP has documented dataset artifacts, bias, and spurious correlations between input features and output labels. However, how to tell which features have "spurious" instead of legitimate correlations is typically left unspecified. In this work we argue that for complex language understanding tasks, all simple feature correlations are spurious, and we formalize this notion into a class of problems which we call competency problems. For example, the word "amazing" on its own should not give information about a sentiment label independent of the context in which it appears, which could include negation, metaphor, sarcasm, etc. We theoretically analyze the difficulty of creating data for competency problems when human bias is taken into account, showing that realistic datasets will increasingly deviate from competency problems as dataset size increases. This analysis…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)