Indirect Identification of Horizontal Gene Transfer
David Schaller, Manuel Lafond, Peter F. Stadler, Nicolas Wieseke and, Marc Hellmuth

TL;DR
This paper characterizes LDT graphs used to infer horizontal gene transfer, providing polynomial-time recognition algorithms and demonstrating how to extract HGT information from these graphs through simulations and graph editing.
Contribution
It introduces a graph-theoretic framework for identifying HGT events, including recognition algorithms and methods to infer HGT from LDT graphs.
Findings
LDT graphs are a subclass of properly vertex-colored cographs.
Recognition algorithms for LDT and rs-Fitch graphs operate in polynomial time.
A greedy graph editing scheme can effectively detect HGT events from LDT graphs.
Abstract
Several implicit methods to infer Horizontal Gene Transfer (HGT) focus on pairs of genes that have diverged only after the divergence of the two species in which the genes reside. This situation defines the edge set of a graph, the later-divergence-time (LDT) graph, whose vertices correspond to genes colored by their species. We investigate these graphs in the setting of relaxed scenarios, i.e., evolutionary scenarios that encompass all commonly used variants of duplication-transfer-loss scenarios in the literature. We characterize LDT graphs as a subclass of properly vertex-colored cographs, and provide a polynomial-time recognition algorithm as well as an algorithm to construct a relaxed scenario that explains a given LDT. An edge in an LDT graph implies that the two corresponding genes are separated by at least one HGT event. The converse is not true, however. We show that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
