Weighted Kolmogorov Smirnov testing: an alternative for Gene Set Enrichment Analysis
Konstantina Charmpi (LJK), Bernard Ycart (LJK)

TL;DR
This paper introduces the Weighted Kolmogorov Smirnov (WKS) test as an alternative to classical GSEA, offering better mathematical properties and potentially more informative results in genomic data analysis.
Contribution
The paper proposes a new test statistic for GSEA with proven convergence and computational advantages, enhancing gene set enrichment analysis.
Findings
WKS test has a well-defined asymptotic distribution.
WKS reduces computation time compared to classical GSEA.
WKS may provide more informative insights in genomic studies.
Abstract
Gene Set Enrichment Analysis (GSEA) is a basic tool for genomic data treatment. From a statistical point of view, the centering of its test statistic does not allow the derivation of asymptotic results. A test statistic with a different centering is proposed. Under the null hypothesis, the convergence in distribution of the new test statistic is proved, using the theory of empirical processes. The limiting distribution can be computed by Monte-Carlo simulation. The test defined in this way has been called Weighted Kolmogorov Smirnov (WKS) test. The fact that the evaluation of the asymptotic distribution serves for many different gene sets results in shorter computing times. Using expression data from the GEO repository, tested against the MSig Database C2, a comparison between the classical GSEA test and the new procedure has been conducted. Our conclusion is that, beyond its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Gene Regulatory Network Analysis · Bioinformatics and Genomic Networks
