Semantic Compression of LLM Instructions via Symbolic Metalanguages

Ernst van Gassen

arXiv:2601.07354·cs.CL·January 13, 2026

Semantic Compression of LLM Instructions via Symbolic Metalanguages

Ernst van Gassen

PDF

Open Access

TL;DR

This paper presents MetaGlyph, a symbolic language for compressing LLM instructions, reducing token usage significantly and improving interpretability without explicit decoding rules, with varied results across models.

Contribution

MetaGlyph introduces a novel symbolic metalanguage for prompt compression that models can interpret directly, enhancing efficiency and interpretability in LLM instruction following.

Findings

01

Achieves 62-81% token reduction across tasks

02

High fidelity in symbolic instruction interpretation for certain models

03

Open-source models show potential with scale to improve fidelity

Abstract

We introduce MetaGlyph, a symbolic language for compressing prompts by encoding instructions as mathematical symbols rather than prose. Unlike systems requiring explicit decoding rules, MetaGlyph uses symbols like $\in$ (membership) and $\Rightarrow$ (implication) that models already understand from their training data. We test whether these symbols work as ''instruction shortcuts'' that models can interpret without additional teaching. We evaluate eight models across two dimensions relevant to practitioners: scale (3B-1T parameters) and accessibility (open-source for local deployment vs. proprietary APIs). MetaGlyph achieves 62-81% token reduction across all task types. For API-based deployments, this translates directly to cost savings; for local deployments, it reduces latency and memory pressure. Results vary by model. Gemini 2.5 Flash achieves 75% semantic equivalence between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMathematics, Computing, and Information Processing · Logic, programming, and type systems · Teaching and Learning Programming