LAWS: Learning from Actual Workloads Symbolically -- A Self-Certifying Parametrized Cache Architecture for Neural Inference, Robotics, and Edge Deployment
Gregory Magarshak

TL;DR
LAWS introduces a self-certifying cache architecture that learns from deployment workloads, providing formal error bounds and generalizing existing caching methods for neural inference and robotics.
Contribution
The paper presents LAWS, a novel symbolic caching architecture with formal error bounds, generalizing Mixture-of-Experts and KV caches, and applicable to neural inference and robotic control.
Findings
LAWS guarantees a bounded approximation error at deployment.
The expert library growth rate is O(2^H log N), where H is workload entropy.
LAWS achieves a convergence speedup proportional to the number of fleet units.
Abstract
We introduce LAWS (Learning from Actual Workloads Symbolically), a self-certifying inference caching architecture that builds a growing library of certified expert functions from deployment observations. Each expert covers a region of input space defined by a node in the Probabilistic Language Trie (PLT) of the base model and carries a formal error bound holding uniformly over all inputs. The central result is a self-certification theorem: for any input x, the LAWS approximation error is bounded by epsilon_fit + 2*Lambda(W)*C_E, where Lambda(W) is the model Lipschitz constant, C_E is the maximum embedding diameter, and epsilon_fit is the expert training error -- all checkable at deployment time without ground truth. We prove that LAWS generalizes both Mixture-of-Experts and KV prefix caching as special cases and is strictly more expressive than any fixed-K MoE or finite cache. Further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
