Why So Gullible? Enhancing the Robustness of Retrieval-Augmented Models   against Counterfactual Noise

Giwon Hong; Jeonghwan Kim; Junmo Kang; Sung-Hyon Myaeng; Joyce Jiyoung; Whang

arXiv:2305.01579·cs.CL·June 11, 2024·2 cites

Why So Gullible? Enhancing the Robustness of Retrieval-Augmented Models against Counterfactual Noise

Giwon Hong, Jeonghwan Kim, Junmo Kang, Sung-Hyon Myaeng, Joyce Jiyoung, Whang

PDF

Open Access 1 Repo

TL;DR

This paper addresses the vulnerability of retrieval-augmented language models to conflicting information within retrieved documents, proposing methods to improve robustness against such noise and introducing a new dataset for further research.

Contribution

It introduces approaches for handling conflicting information in retrieval-augmented models, including fine-tuning discriminators and prompting GPT-3.5, and presents a new dataset to facilitate robustness research.

Findings

01

Significant robustness improvements in open-domain QA tasks.

02

Effective use of discriminators to detect conflicting information.

03

Introduction of MacNoise dataset for conflict-induced data.

Abstract

Most existing retrieval-augmented language models (LMs) assume a naive dichotomy within a retrieved document set: query-relevance and irrelevance. Our work investigates a more challenging scenario in which even the "relevant" documents may contain misleading or incorrect information, causing conflict among the retrieved documents and thereby negatively influencing model decisions as noise. We observe that existing LMs are highly brittle to the presence of conflicting information in both the fine-tuning and in-context few-shot learning scenarios. We propose approaches for handling knowledge conflicts among retrieved documents by explicitly fine-tuning a discriminator or prompting GPT-3.5 to elicit its discriminative capability. Our empirical results on open-domain QA show that these approaches significantly enhance model robustness. We also provide our findings on incorporating the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wjdghks950/discern-and-answer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning

Methods15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Cosine Annealing · Adam · Layer Normalization · Linear Layer · Dropout · Byte Pair Encoding · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia?