Semantic Compression of LLM Instructions via Symbolic Metalanguages
Ernst van Gassen

TL;DR
This paper presents MetaGlyph, a symbolic language for compressing LLM instructions, reducing token usage significantly and improving interpretability without explicit decoding rules, with varied results across models.
Contribution
MetaGlyph introduces a novel symbolic metalanguage for prompt compression that models can interpret directly, enhancing efficiency and interpretability in LLM instruction following.
Findings
Achieves 62-81% token reduction across tasks
High fidelity in symbolic instruction interpretation for certain models
Open-source models show potential with scale to improve fidelity
Abstract
We introduce MetaGlyph, a symbolic language for compressing prompts by encoding instructions as mathematical symbols rather than prose. Unlike systems requiring explicit decoding rules, MetaGlyph uses symbols like (membership) and (implication) that models already understand from their training data. We test whether these symbols work as ''instruction shortcuts'' that models can interpret without additional teaching. We evaluate eight models across two dimensions relevant to practitioners: scale (3B-1T parameters) and accessibility (open-source for local deployment vs. proprietary APIs). MetaGlyph achieves 62-81% token reduction across all task types. For API-based deployments, this translates directly to cost savings; for local deployments, it reduces latency and memory pressure. Results vary by model. Gemini 2.5 Flash achieves 75% semantic equivalence between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematics, Computing, and Information Processing · Logic, programming, and type systems · Teaching and Learning Programming
