Constraints on the perfect phylogeny mixture model and their effect on reducing degeneracy
John Marangola, Azadeh Sheikholeslami, Jos\'e Bento

TL;DR
This paper investigates the limitations of the perfect phylogeny mixture model in resolving evolutionary tree ambiguity and introduces new constraints to reduce degeneracy, supported by theoretical analysis over large problem ensembles.
Contribution
It demonstrates the ineffectiveness of existing longitudinal constraints and proposes novel constraints that better limit solution ambiguity in the PPM model.
Findings
Longitudinal constraints often fail to reduce tree ambiguity.
New constraints can significantly decrease the number of plausible phylogenetic trees.
Theoretical results hold across large ensembles of inference problems.
Abstract
The perfect phylogeny mixture (PPM) model is useful due to its simplicity and applicability in scenarios where mutations can be assumed to accumulate monotonically over time. It is the underlying model in many tools that have been used, for example, to infer phylogenetic trees for tumor evolution and reconstruction. Unfortunately, the PPM model gives rise to substantial ambiguity -- in that many different phylogenetic trees can explain the same observed data -- even in the idealized setting where data are observed perfectly, i.e. fully and without noise. This ambiguity has been studied in this perfect setting by Pradhan et al. 2018, which proposed a procedure to bound the number of solutions given a fixed instance of observation data. Beyond this, studies have been primarily empirical. Recent work (Myers et al. 2019) proposed adding extra constraints to the PPM model to tackle…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenome Rearrangement Algorithms · Cancer Genomics and Diagnostics · Genomics and Phylogenetic Studies
