Valid and Efficient Two-Stage Latent Subgroup Analysis with Observational Data
Yuanhui Luo, Xinzhou Guo, Yuqi Gu

TL;DR
This paper develops a robust, efficient two-stage method for latent subgroup analysis in observational data, addressing bias from misclassification and high-dimensional confounders, with proven consistency and practical validation.
Contribution
It introduces a spectral method-based two-stage approach that achieves valid subgroup analysis despite misclassification, high-dimensional confounders, and noninformative items.
Findings
Method achieves consistent estimation of subgroup effects.
Approach is computationally efficient and robust.
Validated through simulations and real educational data.
Abstract
Subgroup analysis evaluates treatment effects across multiple sub-populations. When subgroups are defined by latent memberships inferred from imperfect measurements, the analysis typically involves two inter-connected models, a latent class model and a subgroup outcome model. The classical one-stage framework, which models the joint distribution of the two models, may be infeasible with observational data containing many confounders. The two-stage framework, which first estimates the latent class model and then performs subgroup analysis using estimated latent memberships, can accommodate potential confounders but may suffer from bias issues due to misclassification of latent subgroup memberships. This paper focuses on latent subgroups inferred from binary item responses and addresses when and how a valid two-stage latent subgroup analysis can be made with observational data. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPsychometric Methodologies and Testing · Advanced Causal Inference Techniques · Statistical Methods and Bayesian Inference
