The Derivation Penalty in Premise-Erasure Caching: Capacity, Strong Converse, and Dispersion Dichotomy
Jianfeng Xu

TL;DR
This paper develops an information-theoretic framework for premise-erasure caching in reasoning engines, revealing fundamental limits, phase transitions, and the derivation penalty, which quantifies cache efficiency under erasure constraints.
Contribution
It introduces a novel capacity and dispersion analysis for derivation-based caching, establishing the derivation penalty and structural rigidity theorems across different architectures.
Findings
Exponential capacity separation between linear-chain and balanced-merge architectures.
Identification of a critical access frequency for optimal caching.
Derivation penalty converges to the reciprocal of the erasure rate universally.
Abstract
We introduce an information-theoretic framework for caching in derivation-based reasoning engines under independent premise erasure. Two decoder models are compared: a coded scheme using an arbitrary bit-string cache with a general-purpose decoder, and a derivation-constrained scheme where the cache consists of logical facts and the decoder must produce a valid proof. Four coding theorems are established. The first proves that each derivation step carries a universal per-step information content determined by the base size. The second reveals an exponential capacity separation between linear-chain and balanced-merge Datalog architectures at equal depth. The third identifies a critical access frequency separating the regimes where caching and on-demand derivation are optimal. The fourth determines the minimum derivation-constrained cache under erasure, decomposing query information into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · Advanced Database Systems and Queries · Distributed systems and fault tolerance
