Gene selection for cancer classification using a hybrid of univariate   and multivariate feature selection methods

Min Xu; Rudy Setiono

arXiv:1506.02085·q-bio.QM·June 18, 2015·5 cites

Gene selection for cancer classification using a hybrid of univariate and multivariate feature selection methods

Min Xu, Rudy Setiono

PDF

Open Access

TL;DR

This paper introduces a hybrid gene selection method combining univariate and multivariate techniques to improve cancer classification accuracy with fewer genes, addressing limitations of existing methods.

Contribution

A novel hybrid gene selection approach combining LIK and RFE methods that yields smaller, more discriminative gene sets with comparable or better accuracy.

Findings

01

Fewer genes needed for accurate classification

02

Effective on leukemia and tumor datasets

03

Outperforms existing gene selection methods

Abstract

Various approaches to gene selection for cancer classification based on microarray data can be found in the literature and they may be grouped into two categories: univariate methods and multivariate methods. Univariate methods look at each gene in the data in isolation from others. They measure the contribution of a particular gene to the classification without considering the presence of the other genes. In contrast, multivariate methods measure the relative contribution of a gene to the classification by taking the other genes in the data into consideration. Multivariate methods select fewer genes in general. However, the selection process of multivariate methods may be sensitive to the presence of irrelevant genes, noises in the expression and outliers in the training data. At the same time, the computational cost of multivariate methods is high. To overcome the disadvantages of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGene expression and cancer classification · Machine Learning in Bioinformatics · Bioinformatics and Genomic Networks