Enhancing Noise Robustness of Retrieval-Augmented Language Models with   Adaptive Adversarial Training

Feiteng Fang; Yuelin Bai; Shiwen Ni; Min Yang; Xiaojun Chen and; Ruifeng Xu

arXiv:2405.20978·cs.AI·June 3, 2024·1 cites

Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training

Feiteng Fang, Yuelin Bai, Shiwen Ni, Min Yang, Xiaojun Chen and, Ruifeng Xu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces RAAT, a novel adaptive adversarial training method for retrieval-augmented language models, significantly improving their robustness against various real-world retrieval noises.

Contribution

The study categorizes real-world retrieval noises and proposes RAAT, a dynamic adversarial training approach that enhances LLM robustness and noise recognition capabilities.

Findings

01

RAAT improves F1 and EM scores under diverse noise conditions.

02

Extensive experiments validate RAAT's effectiveness over baseline models.

03

Code and data are publicly released for reproducibility.

Abstract

Large Language Models (LLMs) exhibit substantial capabilities yet encounter challenges, including hallucination, outdated knowledge, and untraceable reasoning processes. Retrieval-augmented generation (RAG) has emerged as a promising solution, integrating knowledge from external databases to mitigate these challenges. However, inappropriate retrieved passages can potentially hinder the LLMs' capacity to generate comprehensive and high-quality responses. Prior RAG studies on the robustness of retrieval noises often confine themselves to a limited set of noise types, deviating from real-world retrieval environments and limiting practical applicability. In this study, we initially investigate retrieval noises and categorize them into three distinct types, reflecting real-world environments. We analyze the impact of these various retrieval noises on the robustness of LLMs. Subsequently, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

calubkk/raat
pytorchOfficial

Videos

Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training· underline

Taxonomy

TopicsTopic Modeling · Speech Recognition and Synthesis · Natural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Sparse Evolutionary Training · WordPiece · Linear Warmup With Linear Decay · Weight Decay · Attention Dropout · Linear Layer · Byte Pair Encoding · Adam