Self-Normalizing Foundation Model for Enhanced Multi-Omics Data Analysis in Oncology
Asim Waqas, Aakash Tripathi, Sabeen Ahmed, Ashwin Mukund, Hamza, Farooq, Matthew B. Schabath, Paul Stewart, Mia Naeini, Ghulam Rasool

TL;DR
SeNMo is a self-normalizing foundation model trained on multi-omics data across 33 cancer types, effectively predicting patient survival, cancer type, and lymph structure, demonstrating robustness and generalizability in oncology data analysis.
Contribution
This work introduces SeNMo, a novel foundation model specifically designed for multi-omics cancer data, improving prediction accuracy and handling high-dimensional data efficiently.
Findings
SeNMo achieved a C-Index of 0.76 on survival prediction.
SeNMo classified primary cancer type with 99.8% accuracy.
SeNMo effectively predicted tertiary lymph structures across cancer types.
Abstract
Multi-omics research has enhanced our understanding of cancer heterogeneity and progression. Investigating molecular data through multi-omics approaches is crucial for unraveling the complex biological mechanisms underlying cancer, thereby enabling more effective diagnosis, treatment, and prevention strategies. However, predicting patient outcomes through the integration of all available multi-omics data is still an under-study research direction. Here, we present SeNMo, a foundation model that has been trained on multi-omics data across 33 cancer types. SeNMo is particularly efficient in handling multi-omics data characterized by high-width and low-length attributes. We trained SeNMo for the task of overall survival of patients using pan-cancer multi-omics data involving 33 cancer sites from the GDC. The training multi-omics data includes gene expression, DNA methylation, miRNA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Genetics, Bioinformatics, and Biomedical Research · Metabolomics and Mass Spectrometry Studies
