LAWS: Learning from Actual Workloads Symbolically -- A Self-Certifying Parametrized Cache Architecture for Neural Inference, Robotics, and Edge Deployment

Gregory Magarshak

arXiv:2605.04069·cs.LG·May 7, 2026

LAWS: Learning from Actual Workloads Symbolically -- A Self-Certifying Parametrized Cache Architecture for Neural Inference, Robotics, and Edge Deployment

Gregory Magarshak

PDF

TL;DR

LAWS introduces a self-certifying cache architecture that learns from deployment workloads, providing formal error bounds and generalizing existing caching methods for neural inference and robotics.

Contribution

The paper presents LAWS, a novel symbolic caching architecture with formal error bounds, generalizing Mixture-of-Experts and KV caches, and applicable to neural inference and robotic control.

Findings

01

LAWS guarantees a bounded approximation error at deployment.

02

The expert library growth rate is O(2^H log N), where H is workload entropy.

03

LAWS achieves a convergence speedup proportional to the number of fleet units.

Abstract

We introduce LAWS (Learning from Actual Workloads Symbolically), a self-certifying inference caching architecture that builds a growing library of certified expert functions from deployment observations. Each expert covers a region of input space defined by a node in the Probabilistic Language Trie (PLT) of the base model and carries a formal error bound holding uniformly over all inputs. The central result is a self-certification theorem: for any input x, the LAWS approximation error is bounded by epsilon_fit + 2*Lambda(W)*C_E, where Lambda(W) is the model Lipschitz constant, C_E is the maximum embedding diameter, and epsilon_fit is the expert training error -- all checkable at deployment time without ground truth. We prove that LAWS generalizes both Mixture-of-Experts and KV prefix caching as special cases and is strictly more expressive than any fixed-K MoE or finite cache. Further…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.