A topological approach for protein classification
Zixuan Cang, Lin Mu, Kedi Wu, Kristopher Opron, Kelin Xia, Guo-Wei, Wei

TL;DR
This paper introduces a novel topological data analysis method using persistent homology for protein classification, achieving high accuracy across various biological datasets and demonstrating the effectiveness of topological invariants as features.
Contribution
It proposes a molecular topological fingerprint support vector machine (MTF-SVM) that uses topological invariants for protein classification, providing a new independent approach.
Findings
96% accuracy in discriminating drug-bound and unbound M2 channels
Approximately 80% accuracy in classifying hemoglobin forms
85% success rate in identifying protein domains
Abstract
Protein function and dynamics are closely related to its sequence and structure. However prediction of protein function and dynamics from its sequence and structure is still a fundamental challenge in molecular biology. Protein classification, which is typically done through measuring the similarity be- tween proteins based on protein sequence or physical information, serves as a crucial step toward the understanding of protein function and dynamics. Persistent homology is a new branch of algebraic topology that has found its success in the topological data analysis in a variety of disciplines, including molecular biology. The present work explores the potential of using persistent homology as an indepen- dent tool for protein classification. To this end, we propose a molecular topological fingerprint based support vector machine (MTF-SVM) classifier. Specifically, we construct machine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Alzheimer's disease research and treatments · Machine Learning in Bioinformatics
