Lessons learned from manual curation of thousands of gene models in the nematode Pristionchus pacificus
Christian Rödelsperger, Neha Agyal, Shiela Pearl Quiobe, Hao Wu, Waltraud Röseler, Dafne Ibarra-Morales, Ralf J. Sommer

TL;DR
This paper discusses how gene annotations in the nematode Pristionchus pacificus were improved by manually curating over 7,500 gene models and identifying common annotation errors.
Contribution
The study introduces a manual curation workflow that improves gene annotations and highlights frequent annotation errors across species.
Findings
Manual curation corrected over 7,500 (~24%) gene models in Pristionchus pacificus.
Common annotation errors include assembly errors, artificial transcript fusions, and false open reading frames.
The findings can guide genome annotation efforts in other species.
Abstract
Continuous developments in sequencing technologies have led to the generation of chromosome-scale genome assemblies across the whole tree of life, but our ability to annotate genomes has lacked behind. One major problem consists in the fact that typically not all genes are expressed at detectable levels at any given life stage or environment. Therefore, available transcriptome data needs to be complemented by gene prediction programs and protein homology evidence. However, how to optimally combine these different data types is not well understood. Here, we present a case study, where we community curated gene annotations of the Pristionchus pacificus strain RSC011. By incorporation of new Iso-seq and RNA-seq data and genome-wide screening, we identified and corrected more than 7,500 (~24%) gene models. While the improved gene annotation for the RSC011 strain will be useful for the P.…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNematode management and characterization studies · Genetics, Aging, and Longevity in Model Organisms · Entomopathogenic Microorganisms in Pest Control
