An Improved Search Algorithm for Optimal Multiple-Sequence Alignment
S. Schroedl

TL;DR
This paper introduces a novel search algorithm for optimal multiple sequence alignment that combines heuristic pruning with reduced memory usage, outperforming existing methods in efficiency and scalability.
Contribution
It presents an innovative algorithm that merges heuristic search with dynamic programming principles to improve efficiency and memory management in optimal MSA computation.
Findings
Outperforms Partial Expansion A* in time and memory.
Reduces search space to O(kN^(k-1)) similar to dynamic programming.
Achieves practical running times below four times that of A*.
Abstract
Multiple sequence alignment (MSA) is a ubiquitous problem in computational biology. Although it is NP-hard to find an optimal solution for an arbitrary number of sequences, due to the importance of this problem researchers are trying to push the limits of exact algorithms further. Since MSA can be cast as a classical path finding problem, it is attracting a growing number of AI researchers interested in heuristic search algorithms as a challenge with actual practical relevance. In this paper, we first review two previous, complementary lines of research. Based on Hirschbergs algorithm, Dynamic Programming needs O(kN^(k-1)) space to store both the search frontier and the nodes needed to reconstruct the solution path, for k sequences of length N. Best first search, on the other hand, has the advantage of bounding the search space that has to be explored using a heuristic. However, it is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
