A novel entropy-based hierarchical clustering framework for ultrafast protein structure search and alignment
Bar{\i}\c{s} Ekim

TL;DR
This paper introduces a new entropy-based hierarchical clustering framework for ultrafast protein structure search and alignment, significantly reducing computational time while maintaining accuracy, and providing a web-based tool for proteomics research.
Contribution
The paper presents a novel entropy-based hierarchical clustering method for protein structure comparison that improves speed and scalability over existing approaches.
Findings
Replicates gold standard results with minimal sensitivity loss
Achieves significantly shorter search times
Provides a dynamic web-based protein comparison environment
Abstract
Identification and alignment of three-dimensional folding of proteins may yield useful information about relationships too remote to be detected by conventional methods, such as sequence comparison, and may potentially lead to prediction of patterns and motifs in mutual structural fragments. With the exponential increase of structural proteomics data, the methods that scale with the rate of increase of data lose efficiency. Hence, new methods that reduce the computational expense of this problem should be developed. We present a novel framework through which we are able to find and align protein structure neighbors via hierarchical clustering and entropy-based query search, and present a web-based protein database search and alignment tool to demonstrate the applicability of our approach. The resulting method replicates the results of the current gold standard with a minimal loss in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Genomics and Phylogenetic Studies · Advanced Proteomics Techniques and Applications
