Unsupervised Feature Selection for Tumor Profiles using Autoencoders and   Kernel Methods

Martin Palazzo; Pierre Beauseroy; Patricio Yankilevich

arXiv:2007.06106·cs.LG·July 14, 2020

Unsupervised Feature Selection for Tumor Profiles using Autoencoders and Kernel Methods

Martin Palazzo, Pierre Beauseroy, Patricio Yankilevich

PDF

TL;DR

This paper introduces LKFS, an unsupervised method combining autoencoders and kernel learning to select relevant genes and discover tumor subtypes from high-dimensional gene expression data without labels.

Contribution

The paper presents LKFS, a novel unsupervised gene selection technique that improves clustering of tumor subtypes by preserving biological signatures and reducing feature redundancy.

Findings

01

Lower redundancy in selected features

02

Improved clustering performance

03

Effective across multiple tumor datasets

Abstract

Molecular data from tumor profiles is high dimensional. Tumor profiles can be characterized by tens of thousands of gene expression features. Due to the size of the gene expression feature set machine learning methods are exposed to noisy variables and complexity. Tumor types present heterogeneity and can be subdivided in tumor subtypes. In many cases tumor data does not include tumor subtype labeling thus unsupervised learning methods are necessary for tumor subtype discovery. This work aims to learn meaningful and low dimensional representations of tumor samples and find tumor subtype clusters while keeping biological signatures without using tumor labels. The proposed method named Latent Kernel Feature Selection (LKFS) is an unsupervised approach for gene selection in tumor gene expression profiles. By using Autoencoders a low dimensional and denoised latent space is learned as a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFeature Selection