Multi-Omic Data Integration and Feature Selection for Survival-based Patient Stratification via Supervised Concrete Autoencoders
Pedro Henrique da Costa Avelar, Roman Laddach, Sophia Karagiannis, Min, Wu, Sophia Tsoka

TL;DR
This paper introduces a novel supervised autoencoder model for integrating multi-omic data to improve cancer patient survival prediction and feature interpretability, outperforming existing methods.
Contribution
The paper proposes a new Concrete Supervised Autoencoder model that enhances survival prediction and feature selection stability in multi-omics data integration.
Findings
Models outperform or match common baselines in survival separation.
CSAE provides more interpretable features.
Feature selection stability follows a power-law distribution.
Abstract
Cancer is a complex disease with significant social and economic impact. Advancements in high-throughput molecular assays and the reduced cost for performing high-quality multi-omics measurements have fuelled insights through machine learning . Previous studies have shown promise on using multiple omic layers to predict survival and stratify cancer patients. In this paper, we developed a Supervised Autoencoder (SAE) model for survival-based multi-omic integration which improves upon previous work, and report a Concrete Supervised Autoencoder model (CSAE), which uses feature selection to jointly reconstruct the input features as well as predict survival. Our experiments show that our models outperform or are on par with some of the most commonly used baselines, while either providing a better survival separation (SAE) or being more interpretable (CSAE). We also perform a feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Cancer Genomics and Diagnostics
MethodsFeature Selection
