The SCJ small parsimony problem for weighted gene adjacencies (Extended version)
Nina Luhmann, Manuel Lafond, Annelyse Th\'evenin, A\"ida Ouangraoua,, Roland Wittler, Cedric Chauve

TL;DR
This paper introduces an exact algorithm for reconstructing ancestral gene orders by combining weighted adjacency data with the SCJ model, improving accuracy and reducing fragmentation in phylogenetic analyses.
Contribution
It presents a novel fixed-parameter tractable algorithm for the weighted small parsimony problem in ancestral genome reconstruction, incorporating adjacency weights from ancient DNA and probabilistic data.
Findings
Including adjacency weights reduces genome fragmentation.
The problem is NP-hard, but solvable with the proposed FPT algorithm.
Application to mammalian and bacterial data demonstrates effectiveness.
Abstract
Reconstructing ancestral gene orders in a given phylogeny is a classical problem in comparative genomics. Most existing methods compare conserved features in extant genomes in the phylogeny to define potential ancestral gene adjacencies, and either try to reconstruct all ancestral genomes under a global evolutionary parsimony criterion, or, focusing on a single ancestral genome, use a scaffolding approach to select a subset of ancestral gene adjacencies, generally aiming at reducing the fragmentation of the reconstructed ancestral genome. In this paper, we describe an exact algorithm for the Small Parsimony Problem that combines both approaches. We consider that gene adjacencies at internal nodes of the species phylogeny are weighted, and we introduce an objective function defined as a convex combination of these weights and the evolutionary cost under the Single-Cut-or-Join (SCJ)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenome Rearrangement Algorithms · Algorithms and Data Compression · DNA and Biological Computing
