Studying Large Language Model Behaviors Under Context-Memory Conflicts   With Real Documents

Evgenii Kortukov; Alexander Rubinstein; Elisa Nguyen; Seong Joon Oh

arXiv:2404.16032·cs.LG·October 10, 2024

Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real Documents

Evgenii Kortukov, Alexander Rubinstein, Elisa Nguyen, Seong Joon Oh

PDF

Open Access 1 Repo

TL;DR

This paper investigates how large language models handle conflicting information from real documents in retrieval-augmented generation, revealing biases and limitations in updating factual knowledge.

Contribution

It introduces a realistic framework for studying knowledge conflicts in LLMs using real documents, highlighting biases and the impact of parametric knowledge on reading behavior.

Findings

01

Knowledge updates fail less often than previously thought.

02

Parametric bias increases likelihood of update failure.

03

Factual parametric knowledge can hinder reading and updating abilities.

Abstract

Retrieval-augmented generation (RAG) mitigates many problems of fully parametric language models, such as temporal degradation, hallucinations, and lack of grounding. In RAG, the model's knowledge can be updated from documents provided in context. This leads to cases of conflict between the model's parametric knowledge and the contextual information, where the model may not always update its knowledge. Previous work studied context-memory knowledge conflicts by creating synthetic documents that contradict the model's correct parametric answers. We present a framework for studying such knowledge conflicts in a realistic setup. We update incorrect parametric knowledge using real conflicting documents. This reflects how knowledge conflicts arise in practice. In this realistic scenario, we find that knowledge updates fail less often than previously reported. In cases where the models still…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kortukov/realistic_knowledge_conflicts
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Weight Decay · Byte Pair Encoding · Linear Layer · Adam · Linear Warmup With Linear Decay · Layer Normalization · Multi-Head Attention · Dropout