Enhancing Self-Consistency and Performance of Pre-Trained Language Models through Natural Language Inference
Eric Mitchell, Joseph J. Noh, Siyan Li, William S. Armstrong, Ananth, Agarwal, Patrick Liu, Chelsea Finn, Christopher D. Manning

TL;DR
This paper introduces ConCoRD, a framework that enhances the logical consistency and accuracy of pre-trained NLP models by leveraging pre-trained NLI models without additional training.
Contribution
ConCoRD is a novel, training-free method that improves model consistency and accuracy by integrating NLI-based reasoning through factor graphs and MaxSAT solving.
Findings
Boosts accuracy of QA and VQA models
Increases LXMERT's accuracy on ConVQA by 5%
Enhances model consistency without fine-tuning
Abstract
While large pre-trained language models are powerful, their predictions often lack logical consistency across test inputs. For example, a state-of-the-art Macaw question-answering (QA) model answers 'Yes' to 'Is a sparrow a bird?' and 'Does a bird have feet?' but answers 'No' to 'Does a sparrow have feet?'. To address this failure mode, we propose a framework, Consistency Correction through Relation Detection, or ConCoRD, for boosting the consistency and accuracy of pre-trained NLP models using pre-trained natural language inference (NLI) models without fine-tuning or re-training. Given a batch of test inputs, ConCoRD samples several candidate outputs for each input and instantiates a factor graph that accounts for both the model's belief about the likelihood of each answer choice in isolation and the NLI model's beliefs about pair-wise answer choice compatibility. We show that a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsMulti-Head Attention · Attention Is All You Need · Test · Linear Layer · Dense Connections · Residual Connection · Byte Pair Encoding · Attention Dropout · Gated Linear Unit · SentencePiece
