Language Models for Lexical Inference in Context

Martin Schmitt; Hinrich Sch\"utze

arXiv:2102.05331·cs.CL·April 28, 2021

Language Models for Lexical Inference in Context

Martin Schmitt, Hinrich Sch\"utze

PDF

1 Repo

TL;DR

This paper explores the use of pretrained language models for lexical inference in context, introducing three novel approaches that outperform previous methods in recognizing entailment between similar sentences with lexical differences.

Contribution

It presents the first pretrained LM-based methods for LIiC, including a few-shot classifier and pattern-based relation induction approaches, demonstrating their effectiveness.

Findings

01

All proposed methods outperform previous state-of-the-art.

02

Pretrained LMs show strong potential for lexical inference tasks.

03

Analysis reveals factors influencing success and failure of the approaches.

Abstract

Lexical inference in context (LIiC) is the task of recognizing textual entailment between two very similar sentences, i.e., sentences that only differ in one expression. It can therefore be seen as a variant of the natural language inference task that is focused on lexical semantics. We formulate and evaluate the first approaches based on pretrained language models (LMs) for this task: (i) a few-shot NLI classifier, (ii) a relation induction approach based on handcrafted patterns expressing the semantics of lexical inference, and (iii) a variant of (ii) with patterns that were automatically extracted from a corpus. All our approaches outperform the previous state of the art, showing the potential of pretrained LMs for LIiC. In an extensive analysis, we investigate factors of success and failure of our three approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mnschmit/lm-lexical-inference
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.