Unsupervised Feature Selection for Tumor Profiles using Autoencoders and Kernel Methods
Martin Palazzo, Pierre Beauseroy, Patricio Yankilevich

TL;DR
This paper introduces LKFS, an unsupervised method combining autoencoders and kernel learning to select relevant genes and discover tumor subtypes from high-dimensional gene expression data without labels.
Contribution
The paper presents LKFS, a novel unsupervised gene selection technique that improves clustering of tumor subtypes by preserving biological signatures and reducing feature redundancy.
Findings
Lower redundancy in selected features
Improved clustering performance
Effective across multiple tumor datasets
Abstract
Molecular data from tumor profiles is high dimensional. Tumor profiles can be characterized by tens of thousands of gene expression features. Due to the size of the gene expression feature set machine learning methods are exposed to noisy variables and complexity. Tumor types present heterogeneity and can be subdivided in tumor subtypes. In many cases tumor data does not include tumor subtype labeling thus unsupervised learning methods are necessary for tumor subtype discovery. This work aims to learn meaningful and low dimensional representations of tumor samples and find tumor subtype clusters while keeping biological signatures without using tumor labels. The proposed method named Latent Kernel Feature Selection (LKFS) is an unsupervised approach for gene selection in tumor gene expression profiles. By using Autoencoders a low dimensional and denoised latent space is learned as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFeature Selection
