Are Hard Examples also Harder to Explain? A Study with Human and   Model-Generated Explanations

Swarnadeep Saha; Peter Hase; Nazneen Rajani; Mohit Bansal

arXiv:2211.07517·cs.CL·November 15, 2022·1 cites

Are Hard Examples also Harder to Explain? A Study with Human and Model-Generated Explanations

Swarnadeep Saha, Peter Hase, Nazneen Rajani, Mohit Bansal

PDF

Open Access 1 Repo

TL;DR

This study compares human and GPT-3 explanations for data labels across easy and hard samples in the Winograd Schema Challenge, revealing that humans outperform models on hard examples in supportiveness and generalizability.

Contribution

It provides a detailed analysis of how explanation quality varies with sample hardness for both humans and GPT-3, highlighting the limitations of current LLM explanations on difficult samples.

Findings

01

GPT-3 explanations are as grammatical as human explanations regardless of sample hardness.

02

Humans provide more generalizable explanations for easy samples, while GPT-3 excels in supportiveness.

03

Humans outperform GPT-3 in explanation quality on hard samples.

Abstract

Recent work on explainable NLP has shown that few-shot prompting can enable large pretrained language models (LLMs) to generate grammatical and factual natural language explanations for data labels. In this work, we study the connection between explainability and sample hardness by investigating the following research question - "Are LLMs and humans equally good at explaining data labels for both easy and hard samples?" We answer this question by first collecting human-written explanations in the form of generalizable commonsense rules on the task of Winograd Schema Challenge (Winogrande dataset). We compare these explanations with those generated by GPT-3 while varying the hardness of the test samples as well as the in-context samples. We observe that (1) GPT-3 explanations are as grammatical as human explanations regardless of the hardness of the test samples, (2) for easy examples,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

swarnahub/explanationhardness
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Natural Language Processing Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Test · Linear Layer · Cosine Annealing · Refunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Weight Decay · Softmax · Adam