Measurement of statistical evidence on an absolute scale following thermodynamic principles
V. J. Vieland, J. Das, S. E. Hodge, S.-C. Seok

TL;DR
This paper introduces a thermodynamic framework for measuring statistical evidence on an absolute scale, addressing limitations of traditional statistics like p-values and enabling consistent comparison across studies.
Contribution
It proposes a novel thermodynamic-based method to quantify statistical evidence on an absolute scale, improving calibration and interpretability of evidence measures.
Findings
Developed a thermodynamic model for evidence measurement
Demonstrated consistent evidence calibration across studies
Connected statistical evidence with physical thermodynamic principles
Abstract
Statistical analysis is used throughout biomedical research and elsewhere to assess strength of evidence. We have previously argued that typical outcome statistics (including p-values and maximum likelihood ratios) have poor measure-theoretic properties: they can erroneously indicate decreasing evidence as data supporting an hypothesis accumulate; and they are not amenable to calibration, necessary for meaningful comparison of evidence across different study designs, data types, and levels of analysis. We have also previously proposed that thermodynamic theory, which allowed for the first time derivation of an absolute measurement scale for temperature (T), could be used to derive an absolute scale for evidence (E). Here we present a novel thermodynamically-based framework in which measurement of E on an absolute scale, for which "one degree" always means the same thing, becomes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
