Large-scale identification of coevolution signals across homo-oligomeric protein interfaces by Direct Coupling Analysis
Guido Uguzzoni, Shalini John Lovis, Francesco Oteri, Alexander Schug,, Hendrik Szurmant, Martin Weigt

TL;DR
This study uses Direct Coupling Analysis on large sequence datasets to identify coevolution signals in homo-oligomeric protein interfaces, enabling accurate structural modeling of protein complexes.
Contribution
It systematically applies DCA to nearly 2000 protein families, demonstrating its effectiveness in predicting interfaces and subfamily-specific binding modes.
Findings
Large interfaces are often identified by DCA.
DCA differentiates subfamilies with distinct binding modes.
Sequence-based contact info enables accurate oligomer structure prediction.
Abstract
Proteins have evolved to perform diverse cellular functions, from serving as reaction catalysts to coordinating cellular propagation and development. Frequently, proteins do not exert their full potential as monomers but rather undergo concerted interactions as either homo-oligomers or with other proteins as hetero-oligomers. The experimental study of such protein complexes and interactions has been arduous. Theoretical structure prediction methods are an attractive alternative. Here, we investigate homo-oligomeric interfaces by tracing residue coevolution via the global statistical Direct Coupling Analysis (DCA). DCA can accurately infer spatial adjacencies between residues. These adjacencies can be included as constraints in structure-prediction techniques to predict high-resolution models. By taking advantage of the on-going exponential growth of sequence databases, we go…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
