Who Watches the Watchmen? An Appraisal of Benchmarks for Multiple   Sequence Alignment

Stefano Iantorno; Kevin Gori; Nick Goldman; Manuel Gil; and Christophe; Dessimoz

arXiv:1211.2160·q-bio.QM·January 9, 2015

Who Watches the Watchmen? An Appraisal of Benchmarks for Multiple Sequence Alignment

Stefano Iantorno, Kevin Gori, Nick Goldman, Manuel Gil, and Christophe, Dessimoz

PDF

TL;DR

This paper reviews various benchmarking strategies for multiple sequence alignment, highlighting their strengths and limitations, and emphasizes the importance of choosing appropriate benchmarks based on specific application contexts.

Contribution

It provides a comprehensive overview of existing benchmarking methods for MSA, evaluates their effectiveness, and discusses the need for context-dependent benchmark selection.

Findings

01

No universal benchmark for MSA exists

02

Different benchmarks have distinct advantages and risks

03

Context-specific benchmark choice is recommended

Abstract

Multiple sequence alignment (MSA) is a fundamental and ubiquitous technique in bioinformatics used to infer related residues among biological sequences. Thus alignment accuracy is crucial to a vast range of analyses, often in ways difficult to assess in those analyses. To compare the performance of different aligners and help detect systematic errors in alignments, a number of benchmarking strategies have been pursued. Here we present an overview of the main strategies--based on simulation, consistency, protein structure, and phylogeny--and discuss their different advantages and associated risks. We outline a set of desirable characteristics for effective benchmarking, and evaluate each strategy in light of them. We conclude that there is currently no universally applicable means of benchmarking MSA, and that developers and users of alignment tools should base their choice of benchmark…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.