Glitter: Visualizing Lexical Surprisal for Readability in Administrative Texts
Jan \v{C}ern\'y, Ivana Kvapil\'ikov\'a, Silvie Cinkov\'a

TL;DR
This paper introduces Glitter, a visualization tool that estimates text readability by measuring lexical surprisal and information entropy using multiple language models, aiming to improve administrative texts' clarity.
Contribution
The paper presents a novel visualization framework for estimating text readability through lexical surprisal and entropy, specifically targeting bureaucratic language.
Findings
Effective visualization of lexical surprisal for readability analysis
Potential to enhance clarity of administrative texts
Open-source tool available for practical use
Abstract
This work investigates how measuring information entropy of text can be used to estimate its readability. We propose a visualization framework that can be used to approximate information entropy of text using multiple language models and visualize the result. The end goal is to use this method to estimate and improve readability and clarity of administrative or bureaucratic texts. Our toolset is available as a libre software on https://github.com/ufal/Glitter.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText Readability and Simplification · Authorship Attribution and Profiling · Data Visualization and Analytics
