Prior Adaptive Semi-supervised Learning with Application to EHR Phenotyping
Yichi Zhang, Molei Liu, Matey Neykov, Tianxi Cai

TL;DR
This paper introduces PASS, a semi-supervised learning method that leverages both labeled and unlabeled EHR data with surrogate variables to improve disease phenotyping accuracy, especially in high-dimensional settings.
Contribution
The paper proposes a novel prior adaptive semi-supervised estimator for EHR phenotyping that effectively utilizes surrogate variables and unlabeled data, with theoretical guarantees and practical validation.
Findings
PASS outperforms existing methods in simulations.
The method demonstrates improved phenotyping accuracy on rheumatoid arthritis data.
Asymptotic theory supports the estimator's consistency and efficiency.
Abstract
Electronic Health Records (EHR) data, a rich source for biomedical research, have been successfully used to gain novel insight into a wide range of diseases. Despite its potential, EHR is currently underutilized for discovery research due to it's major limitation in the lack of precise phenotype information. To overcome such difficulties, recent efforts have been devoted to developing supervised algorithms to accurately predict phenotypes based on relatively small training datasets with gold standard labels extracted via chart review. However, supervised methods typically require a sizable training set to yield generalizable algorithms especially when the number of candidate features, , is large. In this paper, we propose a semi-supervised (SS) EHR phenotyping method that borrows information from both a small labeled data where both the label and the feature set are observed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Colorectal Cancer Screening and Detection · AI in cancer detection
