Similarity search for local protein structures at atomic resolution by exploiting a database management system
Akira R. Kinjo, Haruki Nakamura

TL;DR
This paper introduces a database-driven method for rapid local protein structure similarity search at atomic resolution, enabling efficient identification of ligand binding site similarities in large datasets.
Contribution
The paper presents a novel geometric indexing approach using relational databases for fast, large-scale local protein structure comparison at atomic detail.
Findings
Able to handle over 160,000 candidates within hours on a desktop
Successfully detects significant structural similarities in non-homologous proteins
Constructs atomic-resolution alignments using the Hungarian algorithm
Abstract
A method to search for local structural similarities in proteins at atomic resolution is presented. It is demonstrated that a huge amount of structural data can be handled within a reasonable CPU time by using a conventional relational database management system with appropriate indexing of geometric data. This method, which we call geometric indexing, can enumerate ligand binding sites that are structurally similar to sub-structures of a query protein among more than 160,000 possible candidates within a few hours of CPU time on an ordinary desktop computer. After detecting a set of high scoring ligand binding sites by the geometric indexing search, structural alignments at atomic resolution are constructed by iteratively applying the Hungarian algorithm, and the statistical significance of the final score is estimated from an empirical model based on a gamma distribution. Applications…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
