The Value of Mechanistic Priors in Sequential Decision Making
Itai Shufaro, Gal Benor, Shie Mannor

TL;DR
This paper quantifies the value of mechanistic priors in sequential decision-making, showing they reduce data needs and improve efficiency, especially in early decision regimes, with applications in pharmacokinetics.
Contribution
It introduces the concept of mechanistic information and provides theoretical bounds for sample efficiency gains in decision models using physical priors.
Findings
Bayesian regret scales with residual entropy in asymptotic regimes.
Hybrid priors significantly improve sample efficiency in burn-in regimes.
LLMs can lose mechanistic information, highlighting the importance of physical priors.
Abstract
Hybrid mechanistic models, physical priors with learned residuals, promise to reduce the data required for good decisions, but have no computable criterion to test this. We characterize the value of mechanistic priors in sequential decision-making within both asymptotic and burn-in regimes. To formalize this, we introduce the mechanistic information of a model -- the mutual information between the model's recommended policy and the true optimal policy -- quantified via an occupancy-weighted bias . In the asymptotic regime (large ), matched bounds reveal that Bayesian regret scales with the residual entropy , delivering a theoretical sample complexity reduction of compared to an uninformed baseline. Furthermore, we provide a model certificate to determine empirical sample efficiency. Complementarily, in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
