# OMAnnotator: a novel approach to building an annotated consensus genome sequence

**Authors:** Sadé Bates, Christophe Dessimoz, Yannis Nevers

PMC · DOI: 10.1093/bioadv/vbag015 · Bioinformatics Advances · 2026-01-22

## TL;DR

OMAnnotator is a new tool that improves genome annotation by combining multiple gene predictions using evolutionary relationships.

## Contribution

OMAnnotator repurposes the OMA algorithm to create more accurate consensus annotations by resolving conflicts between gene prediction methods.

## Key findings

- OMAnnotator outperformed existing annotation combiners on the Drosophila melanogaster reference genome.
- It showed substantial improvements in two newly annotated genomes but mixed results in a third with expert curation.
- The method effectively resolves disagreements among gene prediction tools using evolutionary information.

## Abstract

Advances in sequencing technologies have enabled researchers to sequence whole genomes rapidly and cheaply. However, despite improvements in genome assembly, structural genome annotation (i.e. the identification of protein-coding genes) remains challenging, particularly for eukaryotic genomes. It requires using several approaches (typically ab initio, transcriptomics, and homology search), which may give substantially different results. Deciding which gene models to retain in a consensus is far from trivial, and automated approaches tend to lag behind laborious manual curation efforts in accuracy.

We present OMAnnotator, a novel approach to building a consensus annotation. OMAnnotator repurposes the OMA algorithm, originally designed to elucidate evolutionary relationships among genes across species, to integrate predictions from different annotation sources into a consensus annotation, using evolutionary information as a tie-breaker. During benchmarking on the Drosophila melanogaster reference, OMAnnotator’s consensus improved upon its source annotations and two state-of-the-art pipelines used as annotation combiners with the same inputs. When applied to three recently published genomes, OMAnnotator gave substantial improvements in two cases, and mixed results in the third, which had already benefitted from extensive expert curation. This underlines the method’s effectiveness and robustness for combining the results of disagreeing annotation softwares, strengthening the toolkit for eukaryotic genome annotation.

OMAnnotator is available on GitHub (https://github.com/DessimozLab/OMAnnotator).

## Linked entities

- **Species:** Drosophila melanogaster (taxon 7227)

## Full-text entities

- **Diseases:** P. discolor (MESH:D014075), HPC (MESH:C537243), SLIDINGWINDOW:4:15 (MESH:D012559)
- **Chemicals:** NO (MESH:D009614), OMA (-)
- **Species:** Drosophila melanogaster (fruit fly, species) [taxon 7227], Anopheles gambiae (African malaria mosquito, species) [taxon 7165], Homo sapiens (human, species) [taxon 9606], Siraitia grosvenorii (arhat fruit, species) [taxon 190515], Melanogaster (genus) [taxon 80614], Phyllostomus discolor (pale spear-nosed bat, species) [taxon 89673], Olea europaea (common olive, species) [taxon 4146]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12927413/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12927413/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/PMC12927413/full.md

---
Source: https://tomesphere.com/paper/PMC12927413