Investigating the Effect of Natural Language Explanations on   Out-of-Distribution Generalization in Few-shot NLI

Yangqiaoyu Zhou; Chenhao Tan

arXiv:2110.06223·cs.CL·October 14, 2021

Investigating the Effect of Natural Language Explanations on Out-of-Distribution Generalization in Few-shot NLI

Yangqiaoyu Zhou, Chenhao Tan

PDF

Open Access 1 Repo

TL;DR

This paper investigates whether natural language explanations can improve out-of-distribution generalization in few-shot NLI models, finding that generated explanations often hallucinate and do not enhance prediction accuracy.

Contribution

The study introduces a framework for testing natural language explanations' impact on OOD generalization in few-shot NLI, revealing limitations of current explanation generation methods.

Findings

01

Generated explanations often hallucinate information.

02

Explanations fail to improve OOD prediction performance.

03

BLEU scores of explanations are competitive with groundtruth, but do not correlate with accuracy.

Abstract

Although neural models have shown strong performance in datasets such as SNLI, they lack the ability to generalize out-of-distribution (OOD). In this work, we formulate a few-shot learning setup and examine the effects of natural language explanations on OOD generalization. We leverage the templates in the HANS dataset and construct templated natural language explanations for each template. Although generated explanations show competitive BLEU scores against groundtruth explanations, they fail to improve prediction performance. We further show that generated explanations often hallucinate information and miss key elements that indicate the label.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chicagohai/hans-explanations
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications