The Override Gap: A Magnitude Account of Knowledge Conflict Failure in Hypernetwork-Based Instant LLM Adaptation
Shuaizhi Cheng, Xiang Shi, Zhiwei Zhang, Mingwei Li

TL;DR
This paper identifies a magnitude-based failure in hypernetwork-based LLM adaptation during knowledge conflicts and proposes simple, training-free boosting methods to significantly improve accuracy.
Contribution
It introduces a magnitude account for conflict failure and presents two effective, training-free boosting techniques that enhance deep-conflict accuracy in hypernetwork-based LLM adaptation.
Findings
Selective Layer Boosting raises deep-conflict accuracy from 46.4% to 71.0%.
Conflict-Aware Internalization improves accuracy on Gemma-2B and Mistral-7B models.
The proposed methods outperform vanilla retrieval-augmented generation on medium conflicts.
Abstract
Hypernetwork-based methods such as Doc-to-LoRA internalize a document into an LLM's weights in a single forward pass, but they fail systematically on conflicts: when the document contradicts pretraining knowledge, accuracy collapses to 46.4% on the deepest facts. We show the failure is a magnitude problem rather than a representational one. The hypernetwork already targets the right layers, but its adapter margin is approximately constant across documents while the pretrained margin grows with training frequency, so deep conflicts lose by construction. The account predicts that failure should track prior strength: sorting 194 conflicts by the base model's log-probability on the contradicted fact, baseline accuracy falls from 68% on weak-prior questions to 16% on strong-prior ones, a 52 percentage-point gap. The cure is amplitude. Selective Layer Boosting scales the adapter at its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
