Mathematical aspects of phylogenetic groves
Mareike Fischer

TL;DR
This paper investigates the mathematical properties of phylogenetic groves, a concept used to identify informative species groups for constructing supertrees, and introduces 2-overlap groves to address limitations in the original conjecture.
Contribution
It disproves the conjecture that maximal groves can be easily identified and proposes 2-overlap groves as a new, more effective concept.
Findings
Maximal groves cannot always be identified easily.
Introduction of 2-overlap groves to improve identification.
Disproof of the original conjecture.
Abstract
The inference of new information on the relatedness of species by phylogenetic trees based on DNA data is one of the main challenges of modern biology. But despite all technological advances, DNA sequencing is still a time-consuming and costly process. Therefore, decision criteria would be desirable to decide a priori which data might contribute new information to the supertree which is not explicitly displayed by any input tree. A new concept, so-called groves, to identify taxon sets with the potential to construct such informative supertrees was suggested by An\'e et al. in 2009. But the important conjecture that maximal groves can easily be identified in a database remained unproved and was published on the Isaac Newton Institute's list of open phylogenetic problems. In this paper, we show that the conjecture does not generally hold, but also introduce a new concept, namely 2-overlap…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
