Exact $p$-values for global network alignments via combinatorial analysis of shared GO terms (Subtitle: REFANGO: Rigorous Evaluation of Functional Alignments of Networks using Gene Ontology)
Wayne B. Hayes

TL;DR
This paper introduces a rigorous combinatorial method to compute exact p-values for shared GO terms in global network alignments, enabling statistically sound evaluation of functional similarity in PPI network comparisons.
Contribution
It presents a novel, mathematically rigorous approach to evaluate the significance of shared GO terms in network alignments using exact combinatorial p-values.
Findings
Provides explicit formulas for p-values of shared GO terms
Enables statistically rigorous evaluation of network alignments
Offers a new standard for assessing functional similarity in PPI networks
Abstract
Network alignment aims to uncover topologically similar regions in the protein-protein interaction (PPI) networks of two or more species under the assumption that topologically similar regions tend to perform similar functions. Although there exist a plethora of both network alignment algorithms and measures of topological similarity, currently no gold standard exists for evaluating how well either is able to uncover functionally similar regions. Here we propose a formal, mathematically and statistically rigorous method for evaluating the statistical significance of shared GO terms in a global, 1-to-1 alignment between two PPI networks. We use combinatorics to precisely count the number of possible network alignments in which proteins share a particular GO term. When divided by the number of all possible network alignments, this provides an explicit, exact -value for a network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Gene Regulatory Network Analysis · Computational Drug Discovery Methods
