Leveraging a Joint of Phenotypic and Genetic Features on Cancer Patient Subgrouping
David Oniani, Chen Wang, Yiqing Zhao, Andrew Wen, Hongfang Liu,, Feichen Shen

TL;DR
This paper presents a novel system that combines phenotypic and genetic data from electronic health records and genetic tests to improve cancer patient subgrouping through classification and clustering techniques.
Contribution
It introduces an integrated approach leveraging joint phenotypic and genetic features for more accurate cancer patient stratification, utilizing multiple machine learning models and graph-based embeddings.
Findings
Effective feature filtering enhances model performance.
Multiple machine learning models achieve high classification accuracy.
Graph-based embeddings improve patient clustering quality.
Abstract
Cancer is responsible for millions of deaths worldwide every year. Although significant progress has been achieved in cancer medicine, many issues remain to be addressed for improving cancer therapy. Appropriate cancer patient stratification is the prerequisite for selecting appropriate treatment plan, as cancer patients are of known heterogeneous genetic make-ups and phenotypic differences. In this study, built upon deep phenotypic characterizations extractable from Mayo Clinic electronic health records (EHRs) and genetic test reports for a collection of cancer patients, we developed a system leveraging a joint of phenotypic and genetic features for cancer patient subgrouping. The workflow is roughly divided into three parts: feature preprocessing, cancer patient classification, and cancer patient clustering based. In feature preprocessing step, we performed filtering, retaining the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · AI in cancer detection · Radiomics and Machine Learning in Medical Imaging
MethodsLogistic Regression
