Context-Parametric Inversion: Why Instruction Finetuning Can Worsen Context Reliance
Sachin Goyal, Christina Baek, J. Zico Kolter, Aditi Raghunathan

TL;DR
This paper investigates why instruction finetuning can unexpectedly reduce a language model's reliance on input context during knowledge conflicts, revealing a phenomenon called context-parametric inversion across multiple models and datasets.
Contribution
The study uncovers the counterintuitive effect of instruction finetuning decreasing context reliance in knowledge conflicts and analyzes its causes through controlled experiments and theoretical insights.
Findings
Context reliance initially increases then decreases during finetuning.
The phenomenon occurs across multiple datasets and model families.
Mitigation strategies offer limited but valuable improvements.
Abstract
A standard practice when using large language models is for users to supplement their instruction with an input context containing new information for the model to process. However, models struggle to reliably follow the input context, especially when it conflicts with their parametric knowledge from pretraining. In-principle, one would expect models to adapt to the user context better after instruction finetuning, particularly when handling knowledge conflicts. However, we observe a surprising failure mode: during instruction tuning, the context reliance under knowledge conflicts initially increases as expected, but then gradually decreases as instruction finetuning progresses. This happens while the performance on standard benchmarks keeps on increasing far after this drop. We call this phenomenon context-parametric inversion and observe it across multiple general purpose instruction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaze Tracking and Assistive Technology · Speech and Audio Processing · Advanced Adaptive Filtering Techniques
MethodsPythia
