Graph Neural Networks for Breast Cancer Data Integration
Teodora Reu

TL;DR
This paper introduces a novel pipeline using Graph Neural Networks to integrate multigenomic and clinical breast cancer data into graph representations, enabling improved cancer subtype classification and data understanding.
Contribution
It proposes a new method for integrating heterogeneous cancer data as graphs and applying GNNs for unsupervised embedding generation, enhancing analysis of complex biomedical datasets.
Findings
Achieved up to 98% accuracy on synthetic data
Achieved up to 80% accuracy on METABRIC data
Demonstrated influence of data homophily on pipeline performance
Abstract
International initiatives such as METABRIC (Molecular Taxonomy of Breast Cancer International Consortium) have collected several multigenomic and clinical data sets to identify the undergoing molecular processes taking place throughout the evolution of various cancers. Numerous Machine Learning and statistical models have been designed and trained to analyze these types of data independently, however, the integration of such differently shaped and sourced information streams has not been extensively studied. To better integrate these data sets and generate meaningful representations that can ultimately be leveraged for cancer detection tasks could lead to giving well-suited treatments to patients. Hence, we propose a novel learning pipeline comprising three steps - the integration of cancer data modalities as graphs, followed by the application of Graph Neural Networks in an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Gene expression and cancer classification · AI in cancer detection
