p-Values for Model Evaluation

Frederik Beaujean; Allen Caldwell; Daniel Kollar; Kevin Kroeninger

arXiv:1011.1674·physics.data-an·May 29, 2013

p-Values for Model Evaluation

Frederik Beaujean, Allen Caldwell, Daniel Kollar, Kevin Kroeninger

PDF

TL;DR

This paper discusses the use and interpretation of p-values in model evaluation, clarifying their practical importance and providing insights into their calculation and application in data analysis.

Contribution

It offers a Bayesian perspective on p-values, explains various discrepancy variables, and evaluates their effectiveness in goodness-of-fit testing.

Findings

01

P-values are practically useful despite interpretational confusion.

02

Discrepancy variables can be used to compute p-values for model assessment.

03

Examples demonstrate the application of p-values in typical data analysis scenarios.

Abstract

Deciding whether a model provides a good description of data is often based on a goodness-of-fit criterion summarized by a p-value. Although there is considerable confusion concerning the meaning of p-values, leading to their misuse, they are nevertheless of practical importance in common data analysis tasks. We motivate their application using a Bayesian argumentation. We then describe commonly and less commonly known discrepancy variables and how they are used to define p-values. The distribution of these are then extracted for examples modeled on typical data analysis tasks, and comments on their usefulness for determining goodness-of-fit are given.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.