Protein Contact Prediction by Integrating Joint Evolutionary Coupling Analysis and Supervised Learning
Jianzhu Ma, Sheng Wang, Zhiyong Wang, Jinbo Xu

TL;DR
This paper introduces a novel protein contact prediction method that combines joint evolutionary coupling analysis across related protein families with supervised learning, significantly improving prediction accuracy over existing techniques.
Contribution
The paper presents a group graphical lasso framework that integrates multi-family EC analysis with supervised learning, leveraging shared co-evolution patterns and diverse information sources.
Findings
Outperforms existing contact prediction methods in accuracy.
Effective on both conserved and family-specific contacts.
Utilizes joint EC analysis across related protein families.
Abstract
Protein contacts contain important information for protein structure and functional study, but contact prediction from sequence remains very challenging. Both evolutionary coupling (EC) analysis and supervised machine learning methods are developed to predict contacts, making use of different types of information, respectively. This paper presents a group graphical lasso (GGL) method for contact prediction that integrates joint multi-family EC analysis and supervised learning. Different from existing single-family EC analysis that uses residue co-evolution information in only the target protein family, our joint EC analysis uses residue co-evolution in both the target family and its related families, which may have divergent sequences but similar folds. To implement joint EC analysis, we model a set of related protein families using Gaussian graphical models (GGM) and then co-estimate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · Bioinformatics and Genomic Networks · RNA and protein synthesis mechanisms
