Behavioral Analysis of Information Salience in Large Language Models

Jan Trienes; J\"org Schl\"otterer; Junyi Jessy Li; Christin Seifert

arXiv:2502.14613·cs.CL·May 28, 2025

Behavioral Analysis of Information Salience in Large Language Models

Jan Trienes, J\"org Schl\"otterer, Junyi Jessy Li, Christin Seifert

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates how large language models determine what information is most important during summarization, revealing a hierarchical but internally inaccessible notion of salience that only weakly aligns with human judgments.

Contribution

It introduces an explainable framework to analyze information salience in LLMs through their summarization behavior, providing new insights into their content prioritization mechanisms.

Findings

01

LLMs exhibit a hierarchical and consistent notion of salience.

02

Salience patterns are similar across different models and sizes.

03

Internal salience does not strongly correlate with human perceptions.

Abstract

Large Language Models (LLMs) excel at text summarization, a task that requires models to select content based on its importance. However, the exact notion of salience that LLMs have internalized remains unclear. To bridge this gap, we introduce an explainable framework to systematically derive and investigate information salience in LLMs through their summarization behavior. Using length-controlled summarization as a behavioral probe into the content selection process, and tracing the answerability of Questions Under Discussion throughout, we derive a proxy for how models prioritize information. Our experiments on 13 models across four datasets reveal that LLMs have a nuanced, hierarchical notion of salience, generally consistent across model families and sizes. While models show highly consistent behavior and hence salience patterns, this notion of salience cannot be accessed through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jantrienes/llm-salience
noneOfficial

Videos

Behavioral Analysis of Information Salience in Large Language Models· underline

Taxonomy

TopicsTopic Modeling