Context-Parametric Inversion: Why Instruction Finetuning Can Worsen   Context Reliance

Sachin Goyal; Christina Baek; J. Zico Kolter; Aditi Raghunathan

arXiv:2410.10796·cs.LG·April 22, 2025

Context-Parametric Inversion: Why Instruction Finetuning Can Worsen Context Reliance

Sachin Goyal, Christina Baek, J. Zico Kolter, Aditi Raghunathan

PDF

Open Access

TL;DR

This paper investigates why instruction finetuning can unexpectedly reduce a language model's reliance on input context during knowledge conflicts, revealing a phenomenon called context-parametric inversion across multiple models and datasets.

Contribution

The study uncovers the counterintuitive effect of instruction finetuning decreasing context reliance in knowledge conflicts and analyzes its causes through controlled experiments and theoretical insights.

Findings

01

Context reliance initially increases then decreases during finetuning.

02

The phenomenon occurs across multiple datasets and model families.

03

Mitigation strategies offer limited but valuable improvements.

Abstract

A standard practice when using large language models is for users to supplement their instruction with an input context containing new information for the model to process. However, models struggle to reliably follow the input context, especially when it conflicts with their parametric knowledge from pretraining. In-principle, one would expect models to adapt to the user context better after instruction finetuning, particularly when handling knowledge conflicts. However, we observe a surprising failure mode: during instruction tuning, the context reliance under knowledge conflicts initially increases as expected, but then gradually decreases as instruction finetuning progresses. This happens while the performance on standard benchmarks keeps on increasing far after this drop. We call this phenomenon context-parametric inversion and observe it across multiple general purpose instruction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaze Tracking and Assistive Technology · Speech and Audio Processing · Advanced Adaptive Filtering Techniques

MethodsPythia