TL;DR
This paper investigates how exposure misclassification affects the analysis of cluster size data, especially when cluster size is related to outcomes, and proposes methods to correct bias in such settings.
Contribution
It introduces two frameworks for bias correction in the presence of exposure misclassification and informative cluster size, applicable to marginal and conditional parameter estimation.
Findings
Misclassification can induce or obscure informativeness of cluster size.
Differential misclassification can bias, inflate, or reverse effect estimates.
Proposed methods improve bias correction in real-world epidemiological data.
Abstract
In this paper we study the impact of exposure misclassification when cluster size is potentially informative (i.e., related to outcomes) and when misclassification is differential by cluster size. First, we show that misclassification in an exposure related to cluster size can induce informativeness when cluster size would otherwise be non-informative. Second, we show that misclassification that is differential by informative cluster size can not only attenuate estimates of exposure effects but even inflate or reverse the sign of estimates. To correct for bias in estimating marginal parameters, we propose two frameworks: (i) an observed likelihood approach for joint marginalized models of cluster size and outcomes and (ii) an expected estimating equations approach. Although we focus on estimating marginal parameters, a corollary is that the observed likelihood approach permits valid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
