Glitter: Visualizing Lexical Surprisal for Readability in Administrative Texts

Jan \v{C}ern\'y; Ivana Kvapil\'ikov\'a; Silvie Cinkov\'a

arXiv:2601.05411·cs.CL·January 12, 2026

Glitter: Visualizing Lexical Surprisal for Readability in Administrative Texts

Jan \v{C}ern\'y, Ivana Kvapil\'ikov\'a, Silvie Cinkov\'a

PDF

Open Access

TL;DR

This paper introduces Glitter, a visualization tool that estimates text readability by measuring lexical surprisal and information entropy using multiple language models, aiming to improve administrative texts' clarity.

Contribution

The paper presents a novel visualization framework for estimating text readability through lexical surprisal and entropy, specifically targeting bureaucratic language.

Findings

01

Effective visualization of lexical surprisal for readability analysis

02

Potential to enhance clarity of administrative texts

03

Open-source tool available for practical use

Abstract

This work investigates how measuring information entropy of text can be used to estimate its readability. We propose a visualization framework that can be used to approximate information entropy of text using multiple language models and visualize the result. The end goal is to use this method to estimate and improve readability and clarity of administrative or bureaucratic texts. Our toolset is available as a libre software on https://github.com/ufal/Glitter.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Authorship Attribution and Profiling · Data Visualization and Analytics