Correlations from structure and phylogeny combine constructively in the inference of protein partners from sequences
Andonis Gerardos, Nicola Dietler, Anne-Florence Bitbol

TL;DR
This study demonstrates that correlations from structural contacts and phylogeny synergistically improve protein partner inference from sequences, with phylogeny providing crucial signals especially when contact information is limited.
Contribution
The paper introduces a minimal model to analyze how structural and phylogenetic correlations jointly enhance protein partner inference, highlighting the constructive combination of these signals.
Findings
Correlations from structure and phylogeny combine constructively for better inference.
Phylogeny can compensate for limited structural contact information.
Non-contact site pairs contribute positively to inference performance.
Abstract
Inferring protein-protein interactions from sequences is an important task in computational biology. Recent methods based on Direct Coupling Analysis (DCA) or Mutual Information (MI) allow to find interaction partners among paralogs of two protein families. Does successful inference mainly rely on correlations from structural contacts or from phylogeny, or both? Do these two types of signal combine constructively or hinder each other? To address these questions, we generate and analyze synthetic data produced using a minimal model that allows us to control the amounts of structural constraints and phylogeny. We show that correlations from these two sources combine constructively to increase the performance of partner inference by DCA or MI. Furthermore, signal from phylogeny can rescue partner inference when signal from contacts becomes less informative, including in the realistic case…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBioinformatics and Genomic Networks · Plant and animal studies
