Integrative analysis of gene expression and phenotype data
Min Xu

TL;DR
This paper presents three innovative methods for integrating gene expression and phenotype data, including automated phenotype profiling, robust gene subset selection for classification, and network module discovery, advancing understanding of complex traits like cancer.
Contribution
It introduces new approaches for phenotype profiling, gene subset selection, and network module identification, addressing challenges in high-dimensional, small-sample data analysis.
Findings
Robust multi-dimensional phenotype profiling using gene expression similarity.
Improved classification accuracy with sequences of discriminative gene clusters.
Identification of cancer subtype-specific gene network modules, including a potential tumor suppressor.
Abstract
The linking genotype to phenotype is the fundamental aim of modern genetics. We focus on study of links between gene expression data and phenotype data through integrative analysis. We propose three approaches. 1) The inherent complexity of phenotypes makes high-throughput phenotype profiling a very difficult and laborious process. We propose a method of automated multi-dimensional profiling which uses gene expression similarity. Large-scale analysis show that our method can provide robust profiling that reveals different phenotypic aspects of samples. This profiling technique is also capable of interpolation and extrapolation beyond the phenotype information given in training data. It can be used in many applications, including facilitating experimental design and detecting confounding factors. 2) Phenotype association analysis problems are complicated by small sample size and high…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Gene Regulatory Network Analysis
