Relaxed Agreement Forests

Virginia Aardevol Martinez; Steven Chaplick; Steven Kelk; Ruben; Meuwese; Matus Mihalak; Georgios Stamoulis

arXiv:2309.01110·cs.DS·September 6, 2023

Relaxed Agreement Forests

Virginia Aardevol Martinez, Steven Chaplick, Steven Kelk, Ruben, Meuwese, Matus Mihalak, Georgios Stamoulis

PDF

Open Access 1 Repo

TL;DR

This paper introduces the maximum relaxed agreement forest (MRAF) problem for comparing unrooted binary phylogenetic trees, providing complexity results, approximation algorithms, and special case polynomial-time solutions.

Contribution

It defines MRAF as a new measure allowing overlapping subtrees, proves NP-hardness, offers an O(log n)-approximation, and explores fixed-parameter tractability for caterpillar trees.

Findings

01

MRAF is NP-hard to compute.

02

An O(log n)-approximation algorithm exists for MRAF.

03

Polynomial-time testing for fixed k when at least one tree is a caterpillar.

Abstract

There are multiple factors which can cause the phylogenetic inference process to produce two or more conflicting hypotheses of the evolutionary history of a set X of biological entities. That is: phylogenetic trees with the same set of leaf labels X but with distinct topologies. This leads naturally to the goal of quantifying the difference between two such trees T_1 and T_2. Here we introduce the problem of computing a 'maximum relaxed agreement forest' (MRAF) and use this as a proxy for the dissimilarity of T_1 and T_2, which in this article we assume to be unrooted binary phylogenetic trees. MRAF asks for a partition of the leaf labels X into a minimum number of blocks S_1, S_2, ... S_k such that for each i, the subtrees induced in T_1 and T_2 by S_i are isomorphic up to suppression of degree-2 nodes and taking the labels X into account. Unlike the earlier introduced maximum…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

skelk2001/relaxed_agreement_forests
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Mining Algorithms and Applications · Semantic Web and Ontologies · Bayesian Modeling and Causal Inference