Probing Natural Language Inference Models through Semantic Fragments

Kyle Richardson; Hai Hu; Lawrence S. Moss; Ashish Sabharwal

arXiv:1909.07521·cs.CL·December 3, 2019

Probing Natural Language Inference Models through Semantic Fragments

Kyle Richardson, Hai Hu, Lawrence S. Moss, Ashish Sabharwal

PDF

3 Repos 1 Datasets

TL;DR

This paper introduces semantic fragments as targeted datasets to probe and improve natural language inference models' understanding of complex semantic phenomena, revealing their limitations and potential for rapid fine-tuning.

Contribution

It presents a systematic approach to creating challenge datasets for semantic phenomena, enabling precise evaluation and enhancement of NLI models' linguistic capabilities.

Findings

01

State-of-the-art models perform poorly on semantic fragments

02

Few-minute fine-tuning can enable models to master these phenomena

03

Models retain performance on standard benchmarks after fine-tuning

Abstract

Do state-of-the-art models for language understanding already have, or can they easily learn, abilities such as boolean coordination, quantification, conditionals, comparatives, and monotonicity reasoning (i.e., reasoning about word substitutions in sentential contexts)? While such phenomena are involved in natural language inference (NLI) and go beyond basic linguistic understanding, it is unclear the extent to which they are captured in existing NLI benchmarks and effectively learned by models. To investigate this, we propose the use of semantic fragments---systematically generated datasets that each target a different semantic phenomenon---for probing, and efficiently improving, such capabilities of linguistic models. This approach to creating challenge datasets allows direct control over the semantic diversity and complexity of the targeted linguistic phenomena, and results in a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

tasksource/semantic_fragments_nli
dataset· 11 dl
11 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Weight Decay · Residual Connection · Adam · Layer Normalization · Softmax · Attention Is All You Need · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention