Towards Context-Robust LLMs: A Gated Representation Fine-tuning Approach

Shenglai Zeng; Pengfei He; Kai Guo; Tianqi Zheng; Hanqing Lu; Yue; Xing; Hui Liu

arXiv:2502.14100·cs.CL·February 25, 2025

Towards Context-Robust LLMs: A Gated Representation Fine-tuning Approach

Shenglai Zeng, Pengfei He, Kai Guo, Tianqi Zheng, Hanqing Lu, Yue, Xing, Hui Liu

PDF

Open Access 1 Video

TL;DR

This paper introduces Grft, a lightweight fine-tuning method that enhances large language models to better balance internal knowledge and external context, improving robustness against misleading information.

Contribution

The paper presents a novel gated representation fine-tuning approach that enables LLMs to selectively rely on external context, addressing over-reliance and contradiction issues.

Findings

01

Grft effectively improves context-robustness in LLMs.

02

Requires minimal additional parameters and data for fine-tuning.

03

Enhances LLMs' ability to handle imperfect external evidence.

Abstract

Large Language Models (LLMs) enhanced with external contexts, such as through retrieval-augmented generation (RAG), often face challenges in handling imperfect evidence. They tend to over-rely on external knowledge, making them vulnerable to misleading and unhelpful contexts. To address this, we propose the concept of context-robust LLMs, which can effectively balance internal knowledge with external context, similar to human cognitive processes. Specifically, context-robust LLMs should rely on external context only when lacking internal knowledge, identify contradictions between internal and external knowledge, and disregard unhelpful contexts. To achieve this goal, we introduce Grft, a lightweight and plug-and-play gated representation fine-tuning approach. Grft consists of two key components: a gating mechanism to detect and filter problematic inputs, and low-rank representation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Towards Context-Robust LLMs: A Gated Representation Fine-tuning Approach· underline

Taxonomy

TopicsNatural Language Processing Techniques