Robustness of semiparametric efficiency in nearly-true models for two-phase samples
Thomas Lumley

TL;DR
This paper investigates how nearly-true models affect the efficiency of estimators in two-phase sampling, revealing that slight misspecifications can negate efficiency gains and impact estimator bias and variance.
Contribution
It provides a theoretical analysis of the robustness of semiparametric efficiency under nearly-true models in two-phase sampling, supported by simulations.
Findings
Efficient estimators can have bias comparable to AIPW estimators under slight misspecification.
The mean squared error of efficient estimators may no longer be lower than AIPW in nearly-true models.
Simulation studies confirm the theoretical results across different sampling and modeling scenarios.
Abstract
We examine the performance of efficient and AIPW estimators under two-phase sampling when the complete-data model is nearly correctly specified, in the sense that the misspecification is not reliably detectable from the data by any possible diagnostic or test. Asymptotic results for these nearly true models are obtained by representing them as sequences of misspecified models that are mutually contiguous with a correctly specified model. We find that for the least-favourable direction of model misspecification the bias in the efficient estimator induced can be comparable to the extra variability in the AIPW estimator, so that the mean squared error of the efficient estimator is no longer lower. This can happen when the most-powerful test for the model misspecification still has modest power. We verify that the theoretical results agree with simulation in three examples: a simple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Process Monitoring · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
