Spherical Distance Metrics Applied to Protein Structure Classification
James DeFelice, Vicente M. Reyes

TL;DR
This paper introduces new spherical distance metrics applied to protein structures using the DCRR model, enhancing structural similarity analysis and clustering efficiency in protein classification tasks.
Contribution
It combines DCRR with spherical distance metrics and M-tree indexing to improve protein structure similarity queries and clustering performance over prior methods.
Findings
Improved kNN search accuracy and speed.
Enhanced clustering of protein structures.
Better structural similarity measurement using DCRR and spherical metrics.
Abstract
Structural relationships among proteins are important in the study of their evolution as well as in drug design and development. The protein 3D structure has been shown to be effective with respect to classifying proteins. Prior work has shown that the Double Centroid Reduced Representation (DCRR) model is a useful geometric representation for protein structure with respect to visual models, reducing the quantity of modeled information for each amino acid, yet retaining the most important geometrical and chemical features of each: the centroids of the backbone and of the side-chain. DCRR has not yet been applied in the calculation of geometric structural similarity. Meanwhile, multi-dimensional indexing (MDI) of protein structure combines protein structural analysis with distance metrics to facilitate structural similarity queries and is also used for clustering protein structures into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Protein Structure and Dynamics · Glycosylation and Glycoproteins Research
