Towards Reliable Automatic Protein Structure Alignment
Xuefeng Cui, Shuai Cheng Li, Dongbo Bu, and Ming Li

TL;DR
This paper introduces a global information-based method for protein structure alignment that significantly improves accuracy over existing tools by incorporating multiple similarity measures, including sequence similarity.
Contribution
The work presents a novel global alignment approach that outperforms state-of-the-art methods and integrates sequence similarity into structure alignment scoring.
Findings
Significantly better TM-score and GDT score results compared to existing tools.
Reduces failure probability in fold detection from 42% to 2%.
Incorporates sequence similarity to improve alignment quality.
Abstract
A variety of methods have been proposed for structure similarity calculation, which are called structure alignment or superposition. One major shortcoming in current structure alignment algorithms is in their inherent design, which is based on local structure similarity. In this work, we propose a method to incorporate global information in obtaining optimal alignments and superpositions. Our method, when applied to optimizing the TM-score and the GDT score, produces significantly better results than current state-of-the-art protein structure alignment tools. Specifically, if the highest TM-score found by TMalign is lower than (0.6) and the highest TM-score found by one of the tested methods is higher than (0.5), there is a probability of (42%) that TMalign failed to find TM-scores higher than (0.5), while the same probability is reduced to (2%) if our method is used. This could…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Genomics and Phylogenetic Studies · Protein Structure and Dynamics
