Distance-based phylogenetic inference from typing data: a unifying view

C\'atia Vaz; Marta Nascimento; Jo\~ao A. Carri\c{c}o; Tatiana; Rocher; Alexandre P. Francisco

arXiv:2006.07016·cs.DS·June 16, 2020

Distance-based phylogenetic inference from typing data: a unifying view

C\'atia Vaz, Marta Nascimento, Jo\~ao A. Carri\c{c}o, Tatiana, Rocher, Alexandre P. Francisco

PDF

TL;DR

This paper reviews and unifies distance-based phylogenetic inference algorithms from typing data, highlighting their common principles, computational challenges, and potential improvements for large-scale infectious disease analysis.

Contribution

It provides a comprehensive unifying framework for distance-based phylogenetic algorithms, clarifying their differences and identifying computational bottlenecks.

Findings

01

Unified view of distance-based phylogenetic algorithms

02

Identification of key computational bottlenecks

03

Insights for improving scalability with large data sets

Abstract

Typing methods are widely used in the surveillance of infectious diseases, outbreaks investigation and studies of the natural history of an infection. And their use is becoming standard, in particular with the introduction of High Throughput Sequencing (HTS). On the other hand, the data being generated is massive and many algorithms have been proposed for phylogenetic analysis of typing data, addressing both correctness and scalability issues. Most of the distance-based algorithms for inferring phylogenetic trees follow the closest-pair joining scheme. This is one of the approaches used in hierarchical clustering. And although phylogenetic inference algorithms may seem rather different, the main difference among them resides on how one defines cluster proximity and on which optimization criterion is used. Both cluster proximity and optimization criteria rely often on a model of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.