An end-to-end framework for gene expression classification by integrating a background knowledge graph: application to cancer prognosis prediction
Kazuma Inoue, Ryosuke Kojima, Mayumi Kamada, Yasushi Okuno

TL;DR
This paper presents an end-to-end framework that integrates background biological knowledge graphs with gene expression data to improve cancer prognosis prediction accuracy, identifying key biomarkers and pathways.
Contribution
The study introduces a novel framework that combines secondary biological data with primary gene expression data for enhanced classification in cancer prognosis prediction.
Findings
Higher accuracy than models without background knowledge
Improved ROC-AUC in multiple cancer types
Identification of known and novel biomarkers
Abstract
Biological data may be separated into primary data, such as gene expression, and secondary data, such as pathways and protein-protein interactions. Methods using secondary data to enhance the analysis of primary data are promising, because secondary data have background information that is not included in primary data. In this study, we proposed an end-to-end framework to integrally handle secondary data to construct a classification model for primary data. We applied this framework to cancer prognosis prediction using gene expression data and a biological network. Cross-validation results indicated that our model achieved higher accuracy compared with a deep neural network model without background biological network information. Experiments conducted in patient groups by cancer type showed improvement in ROC-area under the curve for many groups. Visualizations of high accuracy cancer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Machine Learning in Bioinformatics
