Predicting diverse M-best protein contact maps

Siqi Sun; Jianzhu Ma; Sheng Wang; Jinbo Xu

arXiv:1511.09181·q-bio.QM·December 1, 2015·BIBM

Predicting diverse M-best protein contact maps

Siqi Sun, Jianzhu Ma, Sheng Wang, Jinbo Xu

PDF

Open Access

TL;DR

This paper introduces a novel method to predict multiple diverse protein contact maps from a single sequence, improving accuracy over existing methods especially for proteins with few homologs.

Contribution

It presents a structure learning approach that generates a set of diverse contact maps, outperforming traditional EC methods in accuracy for challenging protein families.

Findings

01

Best out of 5 solutions improves accuracy by at least 0.1 over the first.

02

Method performs better than Evfold and PSICOV on test proteins.

03

Especially effective for proteins with few sequence homologs.

Abstract

Protein contacts contain important information for protein structure and functional study, but contact prediction from sequence information remains very challenging. Recently evolutionary coupling (EC) analysis, which predicts contacts by detecting co-evolved residues (or columns) in a multiple sequence alignment (MSA), has made good progress due to better statistical assessment techniques and high-throughput sequencing. Existing EC analysis methods predict only a single contact map for a given protein, which may have low accuracy especially when the protein under prediction does not have a large number of sequence homologs. Analogous to ab initio folding that usually predicts a few possible 3D models for a given protein sequence, this paper presents a novel structure learning method that can predict a set of diverse contact maps for a given protein sequence, in which the best solution…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsProtein Structure and Dynamics · Microbial Metabolic Engineering and Bioproduction · Bioinformatics and Genomic Networks