Tug-of-War Between Knowledge: Exploring and Resolving Knowledge   Conflicts in Retrieval-Augmented Language Models

Zhuoran Jin; Pengfei Cao; Yubo Chen; Kang Liu; Xiaojian Jiang; Jiexin; Xu; Qiuxia Li; Jun Zhao

arXiv:2402.14409·cs.CL·February 23, 2024·3 cites

Tug-of-War Between Knowledge: Exploring and Resolving Knowledge Conflicts in Retrieval-Augmented Language Models

Zhuoran Jin, Pengfei Cao, Yubo Chen, Kang Liu, Xiaojian Jiang, Jiexin, Xu, Qiuxia Li, Jun Zhao

PDF

Open Access

TL;DR

This paper investigates how retrieval-augmented language models handle conflicting knowledge from internal memory and external sources, revealing biases and proposing a method to resolve these conflicts effectively.

Contribution

It introduces an evaluation framework for knowledge conflicts in RALMs, analyzes their behavior and biases, and proposes the Conflict-Disentangle Contrastive Decoding (CD2) method to resolve conflicts.

Findings

01

RALMs favor internal memory even with correct external evidence

02

RALMs exhibit availability and confirmation biases

03

CD2 effectively resolves knowledge conflicts

Abstract

Retrieval-augmented language models (RALMs) have demonstrated significant potential in refining and expanding their internal memory by retrieving evidence from external sources. However, RALMs will inevitably encounter knowledge conflicts when integrating their internal memory with external sources. Knowledge conflicts can ensnare RALMs in a tug-of-war between knowledge, limiting their practical applicability. In this paper, we focus on exploring and resolving knowledge conflicts in RALMs. First, we present an evaluation framework for assessing knowledge conflicts across various dimensions. Then, we investigate the behavior and preference of RALMs from the following two perspectives: (1) Conflicts between internal memory and external sources: We find that stronger RALMs emerge with the Dunning-Kruger effect, persistently favoring their faulty internal memory even when correct evidence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsFocus