Telegraph English: Semantic Prompt Compression via Structured Symbolic Rewriting

Mikhail L. Arbuzov; Sisong Bei; Ziwei Dong; Dmitri Kalaev; Alexey A. Shvets

arXiv:2605.04426·cs.CL·May 7, 2026

Telegraph English: Semantic Prompt Compression via Structured Symbolic Rewriting

Mikhail L. Arbuzov, Sisong Bei, Ziwei Dong, Dmitri Kalaev, Alexey A. Shvets

PDF

TL;DR

Telegraph English (TE) is a prompt compression method that rewrites natural language into a structured, symbolic format, enabling effective semantic compression and indexing with minimal information loss across various models.

Contribution

TE introduces a full semantic rewrite protocol that decomposes input into atomic facts and substitutes verbose phrases with logical symbols, improving compression and semantic clarity.

Findings

01

TE achieves 50% token reduction while preserving 99.1% of key facts with GPT-4.1.

02

TE outperforms LLMLingua-2 at matched compression ratios across models and tasks.

03

Explicit relational structure in TE helps smaller models retain detailed information.

Abstract

We introduce Telegraph English (TE), a prompt-compression protocol that rewrites natural language into a symbol-rich, formally-structured dialect. Where token-deletion methods such as LLMLingua-2 train a classifier to delete low-importance tokens at a fixed ratio, TE performs a full semantic rewrite: it decomposes the input into atomic fact lines, substitutes verbose phrases with $\sim$ 40 logical and relational symbols, and lets the compression ratio adapt to each document's information density. A consequence of the line-structure rule is that compression and semantic chunking become the same operation -- each output line is an independently addressable fact, so the compressed representation is simultaneously a semantic index. We evaluate TE on 4{,}081 question-answer pairs from LongBench-v2 across five OpenAI models and two difficulty levels. At roughly 50\% token reduction, TE…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.