Efficiently Computing Susceptibility to Context in Language Models

Tianyu Liu; Kevin Du; Mrinmaya Sachan; Ryan Cotterell

arXiv:2410.14361·cs.CL·October 21, 2024

Efficiently Computing Susceptibility to Context in Language Models

Tianyu Liu, Kevin Du, Mrinmaya Sachan, Ryan Cotterell

PDF

Open Access

TL;DR

This paper introduces Fisher susceptibility, a fast and efficient method to measure how sensitive language models are to context changes, enabling better analysis of model behavior.

Contribution

We propose Fisher susceptibility, an efficient alternative to Monte Carlo methods for quantifying language model sensitivity to context, validated across diverse domains.

Findings

01

Fisher susceptibility is 70 times faster than Monte Carlo approximation.

02

Larger models are as susceptible as smaller ones.

03

Fisher susceptibility closely matches Monte Carlo estimates.

Abstract

One strength of modern language models is their ability to incorporate information from a user-input context when answering queries. However, they are not equally sensitive to the subtle changes to that context. To quantify this, Du et al. (2024) gives an information-theoretic metric to measure such sensitivity. Their metric, susceptibility, is defined as the degree to which contexts can influence a model's response to a query at a distributional level. However, exactly computing susceptibility is difficult and, thus, Du et al. (2024) falls back on a Monte Carlo approximation. Due to the large number of samples required, the Monte Carlo approximation is inefficient in practice. As a faster alternative, we propose Fisher susceptibility, an efficient method to estimate the susceptibility based on Fisher information. Empirically, we validate that Fisher susceptibility is comparable to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsSparse Evolutionary Training