Enhancing Contextual Understanding in Large Language Models through   Contrastive Decoding

Zheng Zhao; Emilio Monti; Jens Lehmann; Haytham Assem

arXiv:2405.02750·cs.CL·May 7, 2024

Enhancing Contextual Understanding in Large Language Models through Contrastive Decoding

Zheng Zhao, Emilio Monti, Jens Lehmann, Haytham Assem

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a contrastive decoding method with adversarial negative samples to improve the contextual grounding of large language models during text generation, especially in open-domain question answering, without additional training.

Contribution

It presents a novel inference-time technique combining contrastive decoding and adversarial negatives to enhance context integration in LLMs, outperforming existing methods.

Findings

01

Improved factual consistency in generated text.

02

Enhanced contextual understanding demonstrated through experiments.

03

Method operates without additional training.

Abstract

Large language models (LLMs) tend to inadequately integrate input context during text generation, relying excessively on encoded prior knowledge in model parameters, potentially resulting in generated text with factual inconsistencies or contextually unfaithful content. LLMs utilize two primary knowledge sources: 1) prior (parametric) knowledge from pretraining, and 2) contextual (non-parametric) knowledge from input prompts. The study addresses the open question of how LLMs effectively balance these knowledge sources during the generation process, specifically in the context of open-domain question answering. To address this issue, we introduce a novel approach integrating contrastive decoding with adversarial irrelevant passages as negative samples to enhance robust context grounding during generation. Notably, our method operates at inference time without requiring further training.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amazon-science/contextualunderstanding-contrastivedecoding
pytorchOfficial

Videos

Enhancing Contextual Understanding in Large Language Models through Contrastive Decoding· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques