Efficiently Computing Susceptibility to Context in Language Models
Tianyu Liu, Kevin Du, Mrinmaya Sachan, Ryan Cotterell

TL;DR
This paper introduces Fisher susceptibility, a fast and efficient method to measure how sensitive language models are to context changes, enabling better analysis of model behavior.
Contribution
We propose Fisher susceptibility, an efficient alternative to Monte Carlo methods for quantifying language model sensitivity to context, validated across diverse domains.
Findings
Fisher susceptibility is 70 times faster than Monte Carlo approximation.
Larger models are as susceptible as smaller ones.
Fisher susceptibility closely matches Monte Carlo estimates.
Abstract
One strength of modern language models is their ability to incorporate information from a user-input context when answering queries. However, they are not equally sensitive to the subtle changes to that context. To quantify this, Du et al. (2024) gives an information-theoretic metric to measure such sensitivity. Their metric, susceptibility, is defined as the degree to which contexts can influence a model's response to a query at a distributional level. However, exactly computing susceptibility is difficult and, thus, Du et al. (2024) falls back on a Monte Carlo approximation. Due to the large number of samples required, the Monte Carlo approximation is inefficient in practice. As a faster alternative, we propose Fisher susceptibility, an efficient method to estimate the susceptibility based on Fisher information. Empirically, we validate that Fisher susceptibility is comparable to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsSparse Evolutionary Training
