What Evidence Do Language Models Find Convincing?

Alexander Wan; Eric Wallace; Dan Klein

arXiv:2402.11782·cs.CL·August 12, 2024·1 cites

What Evidence Do Language Models Find Convincing?

Alexander Wan, Eric Wallace, Dan Klein

PDF

Open Access 1 Repo 1 Datasets 1 Video

TL;DR

This paper investigates how retrieval-augmented language models evaluate conflicting evidence, revealing their reliance on relevance over stylistic features, and emphasizes the importance of high-quality RAG data for better alignment with human judgments.

Contribution

The study introduces the ConflictingQA dataset and analyzes LLM sensitivities to different evidence features, highlighting current model biases and suggesting improvements for RAG training.

Findings

01

Models rely heavily on relevance of evidence

02

Stylistic features like scientific references are largely ignored

03

High-quality, filtered RAG data is crucial for better alignment

Abstract

Retrieval-augmented language models are being increasingly tasked with subjective, contentious, and conflicting queries such as "is aspartame linked to cancer". To resolve these ambiguous queries, one must search through a large range of websites and consider "which, if any, of this evidence do I find convincing?". In this work, we study how LLMs answer this question. In particular, we construct ConflictingQA, a dataset that pairs controversial queries with a series of real-world evidence documents that contain different facts (e.g., quantitative results), argument styles (e.g., appeals to authority), and answers (Yes or No). We use this dataset to perform sensitivity and counterfactual analyses to explore which text features most affect LLM predictions. Overall, we find that current models rely heavily on the relevance of a website to the query, while largely ignoring stylistic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alexwan0/rag-convincingness
noneOfficial

Datasets

KaiserWhoLearns/conflictqa-u
dataset· 7 dl
7 dl

Videos

What Evidence Do Language Models Find Convincing?· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · WordPiece · Linear Warmup With Linear Decay · Dropout · Linear Layer · Weight Decay · Byte Pair Encoding · Attention Dropout · Dense Connections · Adam