Feature selection for longitudinal microarray data by adapting a pathway analysis method
Suyan Tian, Chi Wang, Howard H. Chang

TL;DR
This paper adapts a gene set analysis method, SAMGSR, for feature selection in longitudinal microarray data, demonstrating its effectiveness in identifying relevant genes and producing parsimonious models.
Contribution
It introduces simple and two-level SAMGSR extensions for feature selection, bridging gene set analysis and feature selection in longitudinal microarray studies.
Findings
Both SAMGSR extensions perform well on real data.
They effectively identify true relevant genes in simulated data.
The models are parsimonious when relevant genes are not highly correlated with irrelevant ones.
Abstract
Introduction: Feature selection and gene set analysis are of increasing interest in bioinformatics. While these two approaches have been developed for different purposes, we describe how some gene set analysis methods can be used to conduct feature selection. Here we adapt the gene set analysis method, significance analysis of microarray gene set reduction (SAMGSR), for feature selection, and propose two extensions-simple SAMGSR and two-level SAMGSR to identify relevant features for longitudinal microarray data. Results and Discussion: When applied to a real-world application, both simple and two-level SAMGSR work comparably well. Using simulated data, we further demonstrate that both SAMGSR extensions have the ability to identify the true relevant genes. If the relevant genes are not highly correlated with the irrelevant ones, the final models given by the two SAMGSR extensions are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Genetic Mapping and Diversity in Plants and Animals
