Probabilistic Language Tries: A Unified Framework for Compression, Decision Policies, and Execution Reuse

Gregory Magarshak

arXiv:2604.06228·cs.LG·April 9, 2026

Probabilistic Language Tries: A Unified Framework for Compression, Decision Policies, and Execution Reuse

Gregory Magarshak

PDF

TL;DR

The paper introduces probabilistic language tries (PLTs), a unified framework for compression, decision policies, and inference reuse, applicable across various domains including search, robotics, and language models.

Contribution

It presents PLTs as a novel structure that explicitly encodes prefix probabilities, enabling optimal compression, policy representation, and efficient inference reuse.

Findings

01

PLTs generalize arithmetic coding for model-conditioned distributions.

02

A prior-guided caching theorem shows lower inference costs with PLTs.

03

A hybrid compression architecture connects to rate-distortion theory.

Abstract

We introduce probabilistic language tries (PLTs), a unified representation that makes explicit the prefix structure implicitly defined by any generative model over sequences. By assigning to each outgoing edge the conditional probability of the corresponding token or action, a PLT simultaneously serves as: (i) an optimal lossless compressor via frequency-weighted interval encoding, generalizing arithmetic coding to model-conditioned distributions; (ii) a policy representation for sequential decision problems including games, search, and robotic control; and (iii) a memoization index that lets repeated inference queries be answered by structured retrieval rather than full model execution. The central technical result is a prior-guided caching theorem: under a stationary generative distribution, a PLT-guided cache achieves strictly lower expected inference cost than any…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.