Multi-study Integration of Brain Cancer Transcriptomes Reveals Organ-Level Molecular Signatures
Jaeyun Sung, Pan-Jun Kim, Shuyi Ma, Cory C. Funk, Andrew T. Magis,, Yuliang Wang, Leroy Hood, Donald Geman, and Nathan D. Price

TL;DR
This study develops a new method to identify molecular signatures for brain cancers using integrated transcriptomic data from multiple studies, achieving high prediction accuracy and addressing data heterogeneity.
Contribution
A novel approach for integrating multi-study transcriptomic data to derive robust brain cancer signatures with improved reproducibility and predictive performance.
Findings
Developed a 44-gene brain cancer marker panel.
Achieved 90% accuracy in phenotype prediction.
Meta-signatures improve robustness across datasets.
Abstract
We utilized abundant transcriptomic data for the primary classes of brain cancers to study the feasibility of separating all of these diseases simultaneously based on molecular data alone. These signatures were based on a new method reported herein that resulted in a brain cancer marker panel of 44 unique genes. Many of these genes have established relevance to the brain cancers examined, with others having known roles in cancer biology. Analyses on large-scale data from multiple sources must deal with significant challenges associated with heterogeneity between different published studies, for it was observed that the variation among individual studies often had a larger effect on the transcriptome than did phenotype differences, as is typical. We found that learning signatures across multiple datasets greatly enhanced reproducibility and accuracy in predictive performance on truly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
