Entropy-Based Decoding for Retrieval-Augmented Large Language Models

Zexuan Qiu; Zijing Ou; Bin Wu; Jingjing Li; Aiwei Liu; Irwin King

arXiv:2406.17519·cs.CL·February 18, 2025·1 cites

Entropy-Based Decoding for Retrieval-Augmented Large Language Models

Zexuan Qiu, Zijing Ou, Bin Wu, Jingjing Li, Aiwei Liu, Irwin King

PDF

Open Access 1 Repo

TL;DR

This paper introduces an entropy-guided decoding method for retrieval-augmented LLMs that improves factual accuracy by reducing distractibility from noisy external and internal knowledge sources.

Contribution

It presents a novel, training-free decoding approach combining entropy-based ensemble and contrastive decoding to enhance relevant information extraction in retrieval-augmented LLMs.

Findings

01

Outperforms existing methods on open-domain question answering datasets

02

Reduces distractibility caused by noisy knowledge sources

03

Improves factual accuracy of generated responses

Abstract

Augmenting Large Language Models (LLMs) with retrieved external knowledge has proven effective for improving the factual accuracy of generated responses. Despite their success, retrieval-augmented LLMs still face the distractibility issue, where the generated responses are negatively influenced by noise from both external and internal knowledge sources. In this paper, we introduce a novel, training-free decoding method guided by entropy considerations to mitigate this issue. Our approach utilizes entropy-based document-parallel ensemble decoding to prioritize low-entropy distributions from retrieved documents, thereby enhancing the extraction of relevant information of context. Additionally, it incorporates a contrastive decoding mechanism that contrasts the obtained low-entropy ensemble distribution with the high-entropy distribution derived from the model's internal knowledge across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

codelion/optillm/blob/main/optillm/entropy_decoding.py
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques