# Annotation of 200 Insect Genomes with BRAKER for Consistent Comparisons across Species

**Authors:** Stepan Saenko, Katharina J. Hoff, Mario Stanke

PMC · DOI: 10.1038/s41597-026-06840-0 · 2026-02-19

## TL;DR

This paper describes the annotation of 200 insect genomes using BRAKER3 to enable consistent comparisons across species.

## Contribution

An automated genome annotation workflow using BRAKER3 for 200 insect species, including 85 previously unannotated.

## Key findings

- Annotations were generated for 85 insect species previously lacking annotations in GenBank.
- The workflow produced gene structures, protein sequences, and orthologous gene groups for comparative analysis.
- A species tree and gene ontology terms were derived from the annotations.

## Abstract

The annotation of genomes progresses slower than their sequencing and assembly. Also, species that were previously annotated can benefit from reannotation using more recent RNA-Seq and protein data, as well as from state-of-the-art annotation methods whose accuracy has improved. Heterogeneous annotations performed with different tools and protein databases can introduce artifactual differences when comparing gene sets or gene structures between species. Recently, the BRAKER3 annotation pipeline was introduced that integrates evidence from RNA-Seq and a protein database. Here, we introduce an automated genome annotation workflow based on BRAKER3 that allows one to annotate a list of species with minimal manual intervention. We selected a diverse set of 200 insect species from different families, including 85 species previously lacking annotations in GenBank. Using currently available RNA-Seq and protein sequence data, we applied our automated workflow to annotate these genomes and conducted downstream analyses typically performed in comparative genomics studies. We present the resulting gene structures, protein sequences, gene ontology terms, orthologous gene groups and a species tree.

## Full-text entities

- **Chemicals:** Selenocysteine (MESH:D017279), Arg (MESH:D001120), Ser (MESH:D012694), raxml (-), Gly (MESH:D005998), Leu (MESH:D007930), Nucleotide (MESH:D009711), Ala (MESH:D000409)
- **Species:** Diptera (flies, order) [taxon 7147], Bombyx mori (domestic silkworm, species) [taxon 7091], Gallus gallus (bantam, species) [taxon 9031], Drosophila melanogaster (fruit fly, species) [taxon 7227], Apis mellifera (bee, species) [taxon 7460], Medicago truncatula (barrel medic, species) [taxon 3880], Ditrysia (clade) [taxon 37567], Polyneoptera (cohort) [taxon 33341], Homo sapiens (human, species) [taxon 9606], Bombus terrestris (buff-tailed bumblebee, species) [taxon 30195], Pterygota (genus) [taxon 300552], Solanum lycopersicum (tomato, species) [taxon 4081], Danio rerio (leopard danio, species) [taxon 7955], Populus trichocarpa (black cottonwood, species) [taxon 3694], Arabidopsis thaliana (mouse-ear cress, species) [taxon 3702], Propsilocerus akamusi (species) [taxon 903466], Mus musculus (house mouse, species) [taxon 10090], Parasteatoda tepidariorum (common house spider, species) [taxon 114398], Caenorhabditis elegans (species) [taxon 6239], Endopterygota (cohort) [taxon 33392], Periplaneta americana (American cockroach, species) [taxon 6978], Paraneoptera (cohort) [taxon 33342], Goniozus legneri (species) [taxon 863708], Prunus persica (peach, species) [taxon 3760], Polyphaga (suborder) [taxon 41084]
- **Mutations:** T2T

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12923644/full.md

---
Source: https://tomesphere.com/paper/PMC12923644