DISCERN: Decoding Systematic Errors in Natural Language for Text   Classifiers

Rakesh R. Menon; Shashank Srivastava

arXiv:2410.22239·cs.CL·October 30, 2024

DISCERN: Decoding Systematic Errors in Natural Language for Text Classifiers

Rakesh R. Menon, Shashank Srivastava

PDF

Open Access 1 Repo 1 Video

TL;DR

DISCERN is a framework that uses language explanations generated by large language models to interpret, identify, and mitigate systematic biases in text classifiers, leading to improved performance and better human interpretability.

Contribution

We introduce DISCERN, a novel method that employs language explanations and an interactive loop between language models to interpret and reduce systematic biases in text classifiers.

Findings

01

Language explanations improve classifier performance beyond bias exemplars.

02

Users interpret systematic biases more effectively with language explanations.

03

Our framework achieves consistent performance gains across multiple datasets.

Abstract

Despite their high predictive accuracies, current machine learning systems often exhibit systematic biases stemming from annotation artifacts or insufficient support for certain classes in the dataset. Recent work proposes automatic methods for identifying and explaining systematic biases using keywords. We introduce DISCERN, a framework for interpreting systematic biases in text classifiers using language explanations. DISCERN iteratively generates precise natural language descriptions of systematic errors by employing an interactive loop between two large language models. Finally, we use the descriptions to improve classifiers by augmenting classifier training sets with synthetically generated instances or annotated examples via active learning. On three text-classification datasets, we demonstrate that language explanations from our framework induce consistent performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rrmenon10/DISCERN
noneOfficial

Videos

DISCERN: Decoding Systematic Errors in Natural Language for Text Classifiers· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling