Information Efficiency of Scientific Automation

Mihir Rao

arXiv:2511.15671·cs.IT·November 20, 2025

Information Efficiency of Scientific Automation

Mihir Rao

PDF

Open Access

TL;DR

This paper models scientific discovery as a thermodynamic process, deriving bounds on information gain under finite work budgets and proposing an efficiency metric to compare learning strategies in scientific automation.

Contribution

It introduces a thermodynamic framework for understanding scientific automation, deriving bounds on information gain and proposing an efficiency metric for different learning strategies.

Findings

01

Finite-work bounds on information gain in Bayesian learning.

02

Federated learning can be more efficient than unpartitioned strategies.

03

Guidance on optimizing scientific automation efforts.

Abstract

Scientific discovery can be framed as a thermodynamic process in which an agent invests physical work to acquire information about an environment under a finite work budget. Using established results about the thermodynamics of computing, we derive finite-budget bounds on information gain over rounds of sequential Bayesian learning. We also propose a metric of information-work efficiency, and compare unpartitioned and federated learning strategies under matched work budgets. The presented results offer guidance in the form of bounds and an information efficiency metric for efforts in scientific automation at large.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Thermodynamics and Statistical Mechanics · Statistical Mechanics and Entropy · Machine Learning and Algorithms