Gene selection for cancer classification using a hybrid of univariate and multivariate feature selection methods
Min Xu, Rudy Setiono

TL;DR
This paper introduces a hybrid gene selection method combining univariate and multivariate techniques to improve cancer classification accuracy with fewer genes, addressing limitations of existing methods.
Contribution
A novel hybrid gene selection approach combining LIK and RFE methods that yields smaller, more discriminative gene sets with comparable or better accuracy.
Findings
Fewer genes needed for accurate classification
Effective on leukemia and tumor datasets
Outperforms existing gene selection methods
Abstract
Various approaches to gene selection for cancer classification based on microarray data can be found in the literature and they may be grouped into two categories: univariate methods and multivariate methods. Univariate methods look at each gene in the data in isolation from others. They measure the contribution of a particular gene to the classification without considering the presence of the other genes. In contrast, multivariate methods measure the relative contribution of a gene to the classification by taking the other genes in the data into consideration. Multivariate methods select fewer genes in general. However, the selection process of multivariate methods may be sensitive to the presence of irrelevant genes, noises in the expression and outliers in the training data. At the same time, the computational cost of multivariate methods is high. To overcome the disadvantages of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Machine Learning in Bioinformatics · Bioinformatics and Genomic Networks
