Data-driven goodness-of-fit tests

Mikhail Langovoy

arXiv:0708.0169·math.ST·September 22, 2017

Data-driven goodness-of-fit tests

Mikhail Langovoy

PDF

Open Access

TL;DR

This paper introduces a versatile, data-driven framework for constructing consistent statistical goodness-of-fit tests that incorporate model selection and penalization, applicable to various complex testing scenarios.

Contribution

It develops a general method unifying multiple test types with model selection, providing theoretical guarantees and broad applicability to inverse, multi-sample, and nonparametric testing.

Findings

01

The tests are consistent under null and alternative hypotheses.

02

Explicit detectability rules for alternative hypotheses are derived.

03

The method applies to inverse problems, multi-sample, and nonparametric tests.

Abstract

We propose and study a general method for construction of consistent statistical tests on the basis of possibly indirect, corrupted, or partially available observations. The class of tests devised in the paper contains Neyman's smooth tests, data-driven score tests, and some types of multi-sample tests as basic examples. Our tests are data-driven and are additionally incorporated with model selection rules. The method allows to use a wide class of model selection rules that are based on the penalization idea. In particular, many of the optimal penalties, derived in statistical literature, can be used in our tests. We establish the behavior of model selection rules and data-driven tests under both the null hypothesis and the alternative hypothesis, derive an explicit detectability rule for alternative hypotheses, and prove a master consistency theorem for the tests from the class. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Advanced Statistical Process Monitoring