Constrained tensor factorization for computational phenotyping and mortality prediction in patients with cancer
Francisco Y Cai, Chengsheng Mao, Yuan Luo

TL;DR
This study applies constrained tensor factorization to EHR data of cancer patients to derive interpretable phenotypes and improve five-year mortality prediction, especially by incorporating social determinants of health.
Contribution
It introduces a constrained tensor factorization approach that integrates supervised terms and SDOH covariates to enhance phenotyping and mortality prediction in cancer patients.
Findings
Filtering by medical indication improves phenotype interpretability.
Prediction performance varies by cancer type and experimental conditions.
Incorporating SDOH covariates enhances mortality prediction accuracy.
Abstract
Background: The increasing adoption of electronic health records (EHR) across the US has created troves of computable data, to which machine learning methods have been applied to extract useful insights. EHR data, represented as a three-dimensional analogue of a matrix (tensor), is decomposed into two-dimensional factors that can be interpreted as computational phenotypes. Methods: We apply constrained tensor factorization to derive computational phenotypes and predict mortality in cohorts of patients with breast, prostate, colorectal, or lung cancer in the Northwestern Medicine Enterprise Data Warehouse from 2000 to 2015. In our experiments, we examined using a supervised term in the factorization algorithm, filtering tensor co-occurrences by medical indication, and incorporating additional social determinants of health (SDOH) covariates in the factorization process. We evaluated the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Health, Environment, Cognitive Aging · Cardiovascular Health and Risk Factors
