Phylogenetic trees and Euclidean embeddings

Mark Layer; John A. Rhodes

arXiv:1605.01039·q-bio.PE·May 4, 2016·1 cites

Phylogenetic trees and Euclidean embeddings

Mark Layer, John A. Rhodes

PDF

Open Access

TL;DR

This paper explains why a square root transformation of phylogenetic distances allows taxa to be embedded in Euclidean space, offering new insights into tree structures and algorithms like NJ and BIONJ.

Contribution

It provides a direct, elementary explanation for the Euclidean embedding of phylogenetic trees, enhancing understanding of tree structures and related algorithms.

Findings

01

Square root transformation enables Euclidean embedding of taxa.

02

The embedding offers new insights into tree-building algorithms.

03

Reinterprets differences between NJ and BIONJ methods.

Abstract

It was recently observed by de Vienne et al. that a simple square root transformation of distances between taxa on a phylogenetic tree allowed for an embedding of the taxa into Euclidean space. While the justification for this was based on a diffusion model of continuous character evolution along the tree, here we give a direct and elementary explanation for it that provides substantial additional insight. We use this embedding to reinterpret the differences between the NJ and BIONJ tree building algorithms, providing one illustration of how this embedding reflects tree structures in data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolution and Paleontology Studies · Genomics and Phylogenetic Studies · Genetic diversity and population structure