Model Assessment Tools for a Model False World

Bruce Lindsay; Jiawei Liu

arXiv:1010.0304·stat.ME·October 5, 2010

Model Assessment Tools for a Model False World

Bruce Lindsay, Jiawei Liu

PDF

TL;DR

This paper introduces a model credibility index to evaluate how well a model approximates the true data-generating process, acknowledging that most models are inherently false but still useful.

Contribution

It proposes a new credibility index based on the maximum sample size where model and true data are indistinguishable, offering a novel perspective on model adequacy.

Findings

01

The credibility index can be estimated using data subsampling.

02

Models are viewed as flawed yet useful despite being false.

03

The approach extends existing hypothesis testing frameworks.

Abstract

A standard goal of model evaluation and selection is to find a model that approximates the truth well while at the same time is as parsimonious as possible. In this paper we emphasize the point of view that the models under consideration are almost always false, if viewed realistically, and so we should analyze model adequacy from that point of view. We investigate this issue in large samples by looking at a model credibility index, which is designed to serve as a one-number summary measure of model adequacy. We define the index to be the maximum sample size at which samples from the model and those from the true data generating mechanism are nearly indistinguishable. We use standard notions from hypothesis testing to make this definition precise. We use data subsampling to estimate the index. We show that the definition leads us to some new ways of viewing models as flawed but useful.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.