TL;DR
This paper introduces CSKS, a lightweight framework that uses proxy models to continuously and precisely steer large language models' sensitivity to contextual knowledge without modifying their weights.
Contribution
The authors propose a novel method using small proxy models to control LLMs' sensitivity to context, enabling continuous adjustment without retraining or modifying the original model.
Findings
CSKS effectively controls LLMs' sensitivity to contextual knowledge.
The framework can both increase and decrease sensitivity as needed.
Experimental results validate the method's precision and practicality.
Abstract
In Large Language Models (LLMs) generation, there exist knowledge conflicts and scenarios where parametric knowledge contradicts knowledge provided in the context. Previous works studied tuning, decoding algorithms, or locating and editing context-aware neurons to adapt LLMs to be faithful to new contextual knowledge. However, they are usually inefficient or ineffective for large models, not workable for black-box models, or unable to continuously adjust LLMs' sensitivity to the knowledge provided in the context. To mitigate these problems, we propose CSKS (Continuously Steering Knowledge Sensitivity), a simple framework that can steer LLMs' sensitivity to contextual knowledge continuously at a lightweight cost. Specifically, we tune two small LMs (i.e. proxy models) and use the difference in their output distributions to shift the original distribution of an LLM without modifying the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
