SubOmiEmbed: Self-supervised Representation Learning of Multi-omics Data for Cancer Type Classification
Sayed Hashim, Muhammad Ali, Karthik Nandakumar, Mohammad Yaqub

TL;DR
SubOmiEmbed introduces a self-supervised variational autoencoder approach for integrating high-dimensional multi-omics data, improving cancer type classification by extracting meaningful, biologically relevant features with fewer network parameters.
Contribution
It extends VAE models with self-supervised feature subsetting to enhance multi-omics data embedding for cancer classification, reducing network size and complexity.
Findings
Comparable performance to baseline models with smaller networks
Effective integration of methylation and expression data
Potential for incorporating additional genomic data
Abstract
For personalized medicines, very crucial intrinsic information is present in high dimensional omics data which is difficult to capture due to the large number of molecular features and small number of available samples. Different types of omics data show various aspects of samples. Integration and analysis of multi-omics data give us a broad view of tumours, which can improve clinical decision making. Omics data, mainly DNA methylation and gene expression profiles are usually high dimensional data with a lot of molecular features. In recent years, variational autoencoders (VAE) have been extensively used in embedding image and text data into lower dimensional latent spaces. In our project, we extend the idea of using a VAE model for low dimensional latent space extraction with the self-supervised learning technique of feature subsetting. With VAEs, the key idea is to make the model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer Genomics and Diagnostics · AI in cancer detection · Gene expression and cancer classification
