Node similarity within subgraphs of protein interaction networks
Orion Penner, Vishal Sood, Gabe Musso, Kim Baskerville, Peter, Grassberger, Maya Paczuski

TL;DR
This paper introduces a new measure called twinness to evaluate local similarity between nodes in protein interaction networks, revealing biological insights and ancestral relationships.
Contribution
The study proposes twinness as a novel, biologically motivated metric for assessing node similarity in protein interaction networks, and demonstrates its effectiveness in identifying paralogous proteins and evolutionary relationships.
Findings
Twinness is significantly higher than null model expectations.
Type A and B twin ratios distinguish prokaryotes from eukaryotes.
Paralogous proteins are over-represented as twins.
Abstract
We propose a biologically motivated quantity, twinness, to evaluate local similarity between nodes in a network. The twinness of a pair of nodes is the number of connected, labeled subgraphs of size n in which the two nodes possess identical neighbours. The graph animal algorithm is used to estimate twinness for each pair of nodes (for subgraph sizes n=4 to n=12) in four different protein interaction networks (PINs). These include an Escherichia coli PIN and three Saccharomyces cerevisiae PINs -- each obtained using state-of-the-art high throughput methods. In almost all cases, the average twinness of node pairs is vastly higher than expected from a null model obtained by switching links. For all n, we observe a difference in the ratio of type A twins (which are unlinked pairs) to type B twins (which are linked pairs) distinguishing the prokaryote E. coli from the eukaryote S.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
