Gaussian Process Regression and Classification using International Classification of Disease Codes as Covariates
Sanvesh Srivastava, Zongyi Xu, Yunyi Li, W. Nick Street, Stephanie, Gilbertson-White

TL;DR
This paper introduces a novel Gaussian process framework utilizing a string kernel for measuring similarity between ICD code sets, enabling improved automated prediction of biomedical responses from electronic health records.
Contribution
It develops a new similarity measure for ICD code sets and integrates it into Gaussian process models for nonparametric regression and classification in healthcare data.
Findings
Outperforms competitors in classifying primary cancer sites.
Provides better sensitivity and specificity in predictions.
Estimates associations between chronic conditions and cancer types.
Abstract
International Classification of Disease (ICD) codes are widely used for encoding diagnoses in electronic health records (EHR). Automated methods have been developed over the years for predicting biomedical responses using EHR that borrow information among diagnostically similar patients. Relatively less attention has been paid to developing patient similarity measures that model the structure of ICD codes and the presence of multiple chronic conditions, where a chronic condition is defined as a set of ICD codes. Motivated by this problem, we first develop a type of string kernel function for measuring similarity between a pair of subsets of ICD codes, which uses the definition of chronic conditions. Second, we extend this similarity measure to define a family of covariance functions on subsets of ICD codes. Using this family, we develop Gaussian process (GP) priors for Bayesian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDietary Effects on Health
