Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training
Feiteng Fang, Yuelin Bai, Shiwen Ni, Min Yang, Xiaojun Chen and, Ruifeng Xu

TL;DR
This paper introduces RAAT, a novel adaptive adversarial training method for retrieval-augmented language models, significantly improving their robustness against various real-world retrieval noises.
Contribution
The study categorizes real-world retrieval noises and proposes RAAT, a dynamic adversarial training approach that enhances LLM robustness and noise recognition capabilities.
Findings
RAAT improves F1 and EM scores under diverse noise conditions.
Extensive experiments validate RAAT's effectiveness over baseline models.
Code and data are publicly released for reproducibility.
Abstract
Large Language Models (LLMs) exhibit substantial capabilities yet encounter challenges, including hallucination, outdated knowledge, and untraceable reasoning processes. Retrieval-augmented generation (RAG) has emerged as a promising solution, integrating knowledge from external databases to mitigate these challenges. However, inappropriate retrieved passages can potentially hinder the LLMs' capacity to generate comprehensive and high-quality responses. Prior RAG studies on the robustness of retrieval noises often confine themselves to a limited set of noise types, deviating from real-world retrieval environments and limiting practical applicability. In this study, we initially investigate retrieval noises and categorize them into three distinct types, reflecting real-world environments. We analyze the impact of these various retrieval noises on the robustness of LLMs. Subsequently, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Speech Recognition and Synthesis · Natural Language Processing Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Sparse Evolutionary Training · WordPiece · Linear Warmup With Linear Decay · Weight Decay · Attention Dropout · Linear Layer · Byte Pair Encoding · Adam
