On the global identifiability of logistic regression models with misclassified outcomes
Rui Duan, Yang Ning, Jiasheng Shi, Raymond J Carroll, Tianxi Cai and, Yong Chen

TL;DR
This paper establishes necessary and sufficient conditions for the global identifiability of logistic regression models with misclassified outcomes, providing practical tools for biomedical research involving electronic health records.
Contribution
It introduces a novel submodel analysis approach and applies differential equation techniques to determine global identifiability in misclassified logistic models, especially with discrete covariates.
Findings
Derived clear conditions for model identifiability
Proposed hypothesis testing methods for non-identifiable models
Applicable to common biomedical data scenarios
Abstract
In the last decade, the secondary use of large data from health systems, such as electronic health records, has demonstrated great promise in advancing biomedical discoveries and improving clinical decision making. However, there is an increasing concern about biases in association studies caused by misclassification in the binary outcomes derived from electronic health records. We revisit the classical logistic regression model with misclassified outcomes. Despite that local identification conditions in some related settings have been previously established, the global identification of such models remains largely unknown and is an important question yet to be answered. We derive necessary and sufficient conditions for global identifiability of logistic regression models with misclassified outcomes, using a novel approach termed as the submodel analysis, and a technique adapted from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Statistical Methods and Inference · Statistical Distribution Estimation and Applications
