Analysis of Two-Phase Studies using Generalized Method of Moments
Prosenjit Kundu, Nilanjan Chatterjee

TL;DR
This paper introduces a novel generalized method of moments approach for analyzing two-phase epidemiological studies, enabling more efficient use of phase-I data without relying on stratification, and demonstrates its effectiveness through simulations and real data application.
Contribution
It develops a new GMM-based method that uses reduced logistic models for phase-I data, improving efficiency over existing stratification-based methods.
Findings
More efficient data utilization with reduced models
Simulation results show improved estimator performance
Application to Wilms Tumor data illustrates practical utility
Abstract
Two-phase design can reduce the cost of epidemiological studies by limiting the ascertainment of expensive covariates or/and exposures to an efficiently selected subset (phase-II) of a larger (phase-I) study. Efficient analysis of the resulting dataset combining disparate information from phase-I and phase-II, however, can be complex. Most of the existing methods including semiparametric maximum-likelihood estimator, require the information in phase-I to be summarized into a fixed number of strata. In this paper, we describe a novel method for analysis of two-phase studies where information from phase-I is summarized by parameters associated with a reduced logistic regression model of the disease outcome on available covariates. We then setup estimating equations for parameters associated with the desired extended logistic regression model, based on information on the reduced model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference
