On the Complexity of Optimising Variants of Phylogenetic Diversity on Phylogenetic Networks
Magnus Bordewich, Charles Semple, Kristina Wicke

TL;DR
This paper investigates the computational complexity of optimizing various variants of Phylogenetic Diversity scores on different classes of rooted phylogenetic networks, extending the classical tree-based measure to more complex network structures.
Contribution
It introduces several variants of PD for taxa related by phylogenetic networks and analyzes their computational complexity for subset selection problems.
Findings
Complexity results vary across network classes.
Some variants are NP-hard to optimize.
Certain network classes admit polynomial-time solutions.
Abstract
Phylogenetic Diversity (PD) is a prominent quantitative measure of the biodiversity of a collection of present-day species (taxa). This measure is based on the evolutionary distance among the species in the collection. Loosely speaking, if is a rooted phylogenetic tree whose leaf set represents a set of species and whose edges have real-valued lengths (weights), then the PD score of a subset of is the sum of the weights of the edges of the minimal subtree of connecting the species in . In this paper, we define several natural variants of the PD score for a subset of taxa which are related by a known rooted phylogenetic network. Under these variants, we explore, for a positive integer , the computational complexity of determining the maximum PD score over all subsets of taxa of size when the input is restricted to different classes of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPlant and animal studies · Genomics and Phylogenetic Studies · Mycorrhizal Fungi and Plant Interactions
