Redundancy-aware unsupervised ranking based on game theory -- application to gene enrichment analysis
Chiara Balestra, Carlo Maj, Emmanuel Mueller, Andreas Mayr

TL;DR
This paper introduces a redundancy-aware, unsupervised ranking method for gene set collections using game theory, specifically Shapley values, to improve interpretability and reduce redundancy in gene enrichment analysis.
Contribution
It proposes a novel game-theoretic ranking approach that accounts for redundancy among gene sets, enhancing interpretability without increasing false positives.
Findings
Low redundancy in ranked gene sets
High coverage of genes in top-ranked sets
Effective reduction of gene set collection complexity
Abstract
Gene set collections are a common ground to study the enrichment of genes for specific phenotypic traits. Gene set enrichment analysis aims to identify genes that are over-represented in gene sets collections and might be associated with a specific phenotypic trait. However, as this involves a massive number of hypothesis testing, it is often questionable whether a pre-processing step to reduce gene sets collections' sizes is helpful. Moreover, the often highly overlapping gene sets and the consequent low interpretability of gene sets' collections demand for a reduction of the included gene sets. Inspired by this bioinformatics context, we propose a method to rank sets within a family of sets based on the distribution of the singletons and their size. We obtain sets' importance scores by computing Shapley values without incurring into the usual exponential number of evaluations of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Bayesian Modeling and Causal Inference · Advanced Causal Inference Techniques
