Human-annotated rationales and explainable text classification: a survey

Elize Herrewijnen; Dong Nguyen; Floris Bex; Kees van Deemter

PMC · DOI:10.3389/frai.2024.1260952·May 24, 2024

Human-annotated rationales and explainable text classification: a survey

Elize Herrewijnen, Dong Nguyen, Floris Bex, Kees van Deemter

TL;DR

This paper reviews how human explanations for classifications can improve data quality and help build more explainable AI models.

Contribution

The paper provides a survey on the collection and use of human-annotated rationales for explainable text classification.

Findings

01

Human-annotated rationales improve data quality and model performance.

02

They serve as a benchmark for evaluating model-generated explanations.

03

Rationales are crucial for advancing explainable artificial intelligence.

Abstract

Asking annotators to explain “why” they labeled an instance yields annotator rationales: natural language explanations that provide reasons for classifications. In this work, we survey the collection and use of annotator rationales. Human-annotated rationales can improve data quality and form a valuable resource for improving machine learning models. Moreover, human-annotated rationales can inspire the construction and evaluation of model-annotated rationales, which can play an important role in explainable artificial intelligence.

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Figures2

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLiterary and Cultural Studies · Educational theories and practices · Spanish Philosophy and Literature