Estimating transmission from genetic and epidemiological data: a metric to compare transmission trees
Michelle Kendall, Diepreye Ayabina, Caroline Colijn

TL;DR
This paper introduces a new metric for comparing transmission trees in epidemiological studies, aiding in the analysis of Bayesian inference results from genetic data, and improving the assessment of transmission dynamics.
Contribution
It develops a novel metric for quantifying differences between transmission trees, accommodating unsampled individuals and highlighting key transmission features.
Findings
The metric reveals sensitivity to priors in posterior trees.
It identifies distinct clusters within collections of transmission trees.
The metric enables summarizing clusters with median trees.
Abstract
Reconstructing who infected whom is a central challenge in analysing epidemiological data. Recently, advances in sequencing technology have led to increasing interest in Bayesian approaches to inferring who infected whom using genetic data from pathogens. The logic behind such approaches is that isolates that are nearly genetically identical are more likely to have been recently transmitted than those that are very different. A number of methods have been developed to perform this inference. However, testing their convergence, examining posterior sets of transmission trees and comparing methods' performance are challenged by the fact that the object of inference - the transmission tree - is a complicated discrete structure. We introduce a metric on transmission trees to quantify distances between them. The metric can accommodate trees with unsampled individuals, and highlights…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
