Probabilistic Language Tries: A Unified Framework for Compression, Decision Policies, and Execution Reuse
Gregory Magarshak

TL;DR
The paper introduces probabilistic language tries (PLTs), a unified framework for compression, decision policies, and inference reuse, applicable across various domains including search, robotics, and language models.
Contribution
It presents PLTs as a novel structure that explicitly encodes prefix probabilities, enabling optimal compression, policy representation, and efficient inference reuse.
Findings
PLTs generalize arithmetic coding for model-conditioned distributions.
A prior-guided caching theorem shows lower inference costs with PLTs.
A hybrid compression architecture connects to rate-distortion theory.
Abstract
We introduce probabilistic language tries (PLTs), a unified representation that makes explicit the prefix structure implicitly defined by any generative model over sequences. By assigning to each outgoing edge the conditional probability of the corresponding token or action, a PLT simultaneously serves as: (i) an optimal lossless compressor via frequency-weighted interval encoding, generalizing arithmetic coding to model-conditioned distributions; (ii) a policy representation for sequential decision problems including games, search, and robotic control; and (iii) a memoization index that lets repeated inference queries be answered by structured retrieval rather than full model execution. The central technical result is a prior-guided caching theorem: under a stationary generative distribution, a PLT-guided cache achieves strictly lower expected inference cost than any…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
