Natural family-free genomic distance
Diego P. Rubert, F\'abio V. Martinez, Mar\'ilia D. V. Braga

TL;DR
This paper introduces a new, more general family-free genomic distance measure using the DCJ model, incorporating insertions and deletions, with an efficient ILP solution demonstrated on diverse genomes.
Contribution
It proposes a novel, comprehensive family-free genomic distance measure with an ILP formulation, extending previous models to include insertions and deletions.
Findings
ILP produces accurate results
Handles diverse genomes including bacteria, fungi, insects, and parts of mammals and plants
Extends family-free distance computation to more complex rearrangements
Abstract
A classical problem in comparative genomics is to compute the rearrangement distance, that is the minimum number of large-scale rearrangements required to transform a given genome into another given genome. While the most traditional approaches in this area are family-based, i.e., require the classification of DNA fragments into families, more recently an alternative family-free approach was proposed, and consists of studying the rearrangement distances without prior family assignment. On the one hand the computation of genomic distances in the family-free setting helps to match occurrences of duplicated genes and find homologies, but on the other hand this computation is NP-hard. In this paper, by letting structural rearrangements be represented by the generic double cut and join (DCJ) operation and also allowing insertions and deletions of DNA segments, we propose a new and more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
