Goodness-of-Fit Tests for Censored and Truncated Data: Maximum Mean Discrepancy Over Regular Functionals
Juan Carlos Escanciano, Jacobo de U\~na-\'Alvarez

TL;DR
This paper introduces a new goodness-of-fit testing method for censored and truncated data using a maximum-mean-discrepancy approach, overcoming computational issues of traditional methods and applicable to various incomplete-data scenarios.
Contribution
It develops a regular, omnibus goodness-of-fit test based on a reproducing kernel Hilbert space, suitable for complex censored and truncated data models, including the first test for random double truncation.
Findings
The proposed test maintains size and power in simulations.
It provides a practical bootstrap method for critical value computation.
The method is demonstrated on real incomplete-data examples.
Abstract
We develop a systematic, omnibus approach to goodness-of-fit testing for parametric distributional models when the variable of interest is only partially observed due to censoring and/or truncation. In many such designs, tests based on the nonparametric maximum likelihood estimator are hindered by nonexistence, computational instability, or convergence rates too slow to support reliable calibration under composite nulls. We avoid these difficulties by constructing a regular (pathwise differentiable) Neyman-orthogonal score process indexed by test functions, and aggregating it over a reproducing kernel Hilbert space ball. This yields a maximum-mean-discrepancy-type supremum statistic with a convenient quadratic-form representation. Critical values are obtained via a multiplier bootstrap that keeps nuisance estimates fixed. We establish asymptotic validity under the null and local…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Causal Inference Techniques · Statistical Distribution Estimation and Applications
