A semiparametric two-sample homogeneity test with nonignorable nonresponse using callback data
Xinyu Wang, Tao Yu, Chunlin Wang, Pengfei Li

TL;DR
This paper introduces a semiparametric method for testing the homogeneity of two distributions in surveys with nonignorable nonresponse, leveraging callback data to improve accuracy and power.
Contribution
It develops a novel empirical likelihood ratio test within a semiparametric framework that accounts for nonignorable nonresponse using callback data.
Findings
The proposed test controls type I error effectively.
It achieves higher power than existing methods ignoring nonignorable missingness.
Application to real survey data demonstrates practical utility.
Abstract
Testing the homogeneity of two distributions is fundamental in statistics, but classical procedures may fail under nonignorable nonresponse. In many surveys, callback data record repeated contact attempts and provide auxiliary information about the response mechanism. We develop a semiparametric framework for two-sample homogeneity testing that explicitly incorporates such information. The response mechanism is modeled by a flexible semiparametric callback model, while the two population distributions are linked through a density ratio model. Within this unified framework, we propose an empirical likelihood ratio test for distributional homogeneity and show that, under the null hypothesis, it has a Wilks-type chi-square limit. To facilitate computation, we develop an efficient expectation-maximization-type algorithm. Simulation results show that the proposed method controls type I error…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
