Determining distances and consensus between mutation trees
Lu\'is Cunha, Jack Kuipers, Thiago Lopes

TL;DR
This paper introduces methods to measure distances between mutation trees, addresses the computational complexity of finding consensus trees, and proposes algorithms for efficient approximation and improved consensus solutions.
Contribution
It presents an efficient approach to compute distances between mutation trees, analyzes the NP-completeness of median and closest tree problems, and develops algorithms for consensus tree estimation.
Findings
Proposed a distance measure based on swap operations for trees.
Proved median and closest tree problems are NP-complete.
Developed algorithms that produce better consensus trees than input sets.
Abstract
The mutational heterogeneity of tumours can be described with a tree representing the evolutionary history of the tumour. With noisy sequencing data there may be uncertainty in the inferred tree structure, while we may also wish to study patterns in the evolution of cancers in different patients. In such situations, understanding tree similarities is a key challenge, and therefore we present an approach to determine distances between trees. Considering the bounded height of trees, we determine the distances associated with the swap operations over strings. While in general, by solving the {\sc Maximum Common Almost -tree} problem between two trees, we describe an efficient approach to determine the minimum number of operations to transform one tree into another. The inherent noise in current statistical methods for constructing mutation evolution trees of cancer cells presents a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
