Alignment Metric Accuracy
Ariel S. Schwartz, Eugene W. Myers, Lior Pachter

TL;DR
This paper introduces a new alignment accuracy metric and an algorithm called AMAP that improves the assessment and quality of multiple sequence alignments, especially in identifying unalignable regions and balancing sensitivity and specificity.
Contribution
The paper presents a novel accuracy metric for sequence alignments, an algorithm for maximizing expected accuracy, and demonstrates superior performance of AMAP on benchmark datasets.
Findings
The new metric improves accuracy assessment when a reference alignment is available.
The method effectively identifies and discards low-accuracy, unalignable regions.
AMAP outperforms existing multiple alignment programs on benchmarks.
Abstract
We propose a metric for the space of multiple sequence alignments that can be used to compare two alignments to each other. In the case where one of the alignments is a reference alignment, the resulting accuracy measure improves upon previous approaches, and provides a balanced assessment of the fidelity of both matches and gaps. Furthermore, in the case where a reference alignment is not available, we provide empirical evidence that the distance from an alignment produced by one program to predicted alignments from other programs can be used as a control for multiple alignment experiments. In particular, we show that low accuracy alignments can be effectively identified and discarded. We also show that in the case of pairwise sequence alignment, it is possible to find an alignment that maximizes the expected value of our accuracy measure. Unlike previous approaches based on expected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Scientific Computing and Data Management · Algorithms and Data Compression
