A Space-Efficient Approach towards Distantly Homologous Protein   Similarity Searches

Akash Nag; Sunil Karforma

arXiv:1508.06561·cs.CE·August 27, 2015

A Space-Efficient Approach towards Distantly Homologous Protein Similarity Searches

Akash Nag, Sunil Karforma

PDF

Open Access

TL;DR

This paper introduces a space-efficient heuristic algorithm for protein similarity searches that balances speed, sensitivity, and low memory usage, especially effective for moderately sized databases and short queries.

Contribution

It presents a novel heuristic pair-wise sequence alignment method with constant space complexity, improving efficiency and sensitivity in distantly homologous protein searches.

Findings

01

Fast and space-efficient for moderate databases

02

Capable of detecting distantly related proteins

03

Produces high-quality alignments

Abstract

Protein similarity searches are a routine job for molecular biologists where a query sequence of amino acids needs to be compared and ranked against an ever-growing database of proteins. All available algorithms in this field can be grouped into two categories, either solving the problem using sequence alignment through dynamic programming, or, employing certain heuristic measures to perform an initial screening followed by applying an optimal sequence alignment algorithm to the closest matching candidates. While the first approach suffers from huge time and space demands, the latter approach might miss some protein sequences which are distantly related to the query sequence. In this paper, we propose a heuristic pair-wise sequence alignment algorithm that can be efficiently employed for protein database searches for moderately sized databases. The proposed algorithm is sufficiently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenomics and Phylogenetic Studies · Advanced Proteomics Techniques and Applications · Algorithms and Data Compression