Enhancing Self-Consistency and Performance of Pre-Trained Language   Models through Natural Language Inference

Eric Mitchell; Joseph J. Noh; Siyan Li; William S. Armstrong; Ananth; Agarwal; Patrick Liu; Chelsea Finn; Christopher D. Manning

arXiv:2211.11875·cs.CL·November 23, 2022·1 cites

Enhancing Self-Consistency and Performance of Pre-Trained Language Models through Natural Language Inference

Eric Mitchell, Joseph J. Noh, Siyan Li, William S. Armstrong, Ananth, Agarwal, Patrick Liu, Chelsea Finn, Christopher D. Manning

PDF

Open Access

TL;DR

This paper introduces ConCoRD, a framework that enhances the logical consistency and accuracy of pre-trained NLP models by leveraging pre-trained NLI models without additional training.

Contribution

ConCoRD is a novel, training-free method that improves model consistency and accuracy by integrating NLI-based reasoning through factor graphs and MaxSAT solving.

Findings

01

Boosts accuracy of QA and VQA models

02

Increases LXMERT's accuracy on ConVQA by 5%

03

Enhances model consistency without fine-tuning

Abstract

While large pre-trained language models are powerful, their predictions often lack logical consistency across test inputs. For example, a state-of-the-art Macaw question-answering (QA) model answers 'Yes' to 'Is a sparrow a bird?' and 'Does a bird have feet?' but answers 'No' to 'Does a sparrow have feet?'. To address this failure mode, we propose a framework, Consistency Correction through Relation Detection, or ConCoRD, for boosting the consistency and accuracy of pre-trained NLP models using pre-trained natural language inference (NLI) models without fine-tuning or re-training. Given a batch of test inputs, ConCoRD samples several candidate outputs for each input and instantiates a factor graph that accounts for both the model's belief about the likelihood of each answer choice in isolation and the NLI model's beliefs about pair-wise answer choice compatibility. We show that a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsMulti-Head Attention · Attention Is All You Need · Test · Linear Layer · Dense Connections · Residual Connection · Byte Pair Encoding · Attention Dropout · Gated Linear Unit · SentencePiece