ONT read assembly of the black rhino genome
Ken Kraaijeveld, Koen Bossers, Nikola Petrusevski, Stef Pieterman, Linda G.R. Bruins-van Sonsbeek, Floyd Wittink

TL;DR
Researchers assembled the genome of a black rhino using long-read sequencing to support conservation efforts and health studies in this endangered species.
Contribution
A high-quality draft genome of the black rhino assembled using only ONT long-read sequencing data.
Findings
The genome assembly consists of 2.47 Gb across 834 contigs with an N50 of 29.53 Mb.
Over 99% of gene features were retrieved by lifting over an existing annotation.
The assembly will aid conservation genetics and health research in black rhinos.
Abstract
The black rhinoceros (Diceros bicornis) is an endangered mammal for which a captive breeding program is part of the conservation effort. Black rhinos in zoo’s often suffer from chronic infections and heamochromatosis. Furthermore, breeding is hampered by low male fertility. To aid a research project studying these topics, we sequenced and assembled the genome of a captive male black rhino using ONT sequencing data only. This work produced over 100 Gb whole genome sequencing reads from whole blood. These were assembled into a 2.47 Gb draft genome consisting of 834 contigs with an N50 of 29.53 Mb. The genome annotation was lifted over from an available genome annotation for black rhino, which resulted in the retrieval of over 99% of gene features. This new genome assembly will be a valuable resource in for conservation genetic research in this species. Black rhinoceros, Diceros…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
- —by SIA Raak Publiek
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Identification and Quantification in Food · Molecular Biology Techniques and Applications
Objective
Ongoing habitat loss, overexploitation, climate change and other factors have severely reduced population size of many large mammals. In addition to preventing and reversing these causal factors of biodiversity loss, ex-situ captive breeding programs can be an effective method for boosting small populations. Captive breeding programs are being attempted to augment the wild population of several species and prevent their extinction [1, 2].
Genomic resources play an increasingly important role in conservation-related captive breeding projects [2]. For example, measurement of genetic diversity and relatedness between individuals may inform choice of breeding individuals [1]. Genomic resources may also aid in managing health and fertility of captive animals [2].
The Eastern black rhinoceros Diceros bicornis is critically endangered in the wild. The total number of individuals is estimated at 3,142, many of which live in captivity [3]. A captive breeding program aimed at boosting population size is currently ongoing [4]. Several aspects of this effort benefit from genomic resources, including genetics-informed choice of breeding animals, microbiome research in relation to health issues and the genetics of male fertility. Several genome assemblies are available for black rhino, but these are either from a female or produced using short reads, preventing the study of Y-chromosomal regions that may affect male fertility. We sequenced the genome of a male black rhino using ONT long reads with the aim of generating long contigs that can be assigned to the sex chromosomes.
We present here the results of the sequencing, assembly and annotation. A preliminary identification of sex-chromosomal regions in this genome assembly is outlined in [5].
Data description
A blood sample was collected from the Eastern black rhino ‘Vungu’ held at Blijdorp Zoo (Rotterdam, The Netherlands) during routine medical examination. DNA was extracted from whole blood using the Nanobind CBB kit (Circulomics) following the manufacturer’s instructions. ONT sequencing libraries were prepared using ligation sequencing kit v9 (SQK-LSK109) and v10 (SQK-LSK110) and sequenced on a Minion device with both R9 and R10 flowcells. FAST5 data was basecalled using the Guppy basecaller version 3.3.3. This resulted in 5.7 million reads with an average read length of 18 kb (Table 1, Data set 1, [6]). The sequence reads were assembled de novo using Flye with the parameters --nano-raw = ONT regular reads, pre-Guppy5, −i 2 = number of polishing iterations [7]. This resulted in a draft genome assembly of 2.47 Gb, consisting of 834 contigs with a contig N50 of 29.5 Mb. The draft genome assembly was annotated by lifting over the annotation of accession GCA_013634535.1 [8], which is the only Black rhino genome assembly for which an annotation is currently available. Over 99.9% of gene features were retrieved in the new genome assembly.
Table 1. Overview of data files/data setsLabelName of data file/data setFile types(file extension)Data repository and identifier (DOI or accession number)Data file 1Genome assembly of D. bicornisfasta file (.fna)https://www.ncbi.nlm.nih.gov/nuccore/JANTPW010000000 [9]Data file 2Gene annotationgff file (.gff)Figshare, 10.6084/m9.figshare.22699801 [10]Data set 1ONT sequence reads of D. bicornis genomic DNAfastq files (.fastq)NCBI Sequence Read Archive (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA777872) [6]
Limitations
The genome assembly is still fragmented and can be further improved using Hi-C, bionano or other techniques.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Witzenberger KA Hochkirch A Ex situ conservation genetics: a review of molecular studies on the genetic consequences of captive breeding programmes for endangered animal species Biodivers Conserv 20112018436110.1007/s 10531-011-0074-4 · doi ↗
- 2Supple MA Shapiro B Conservation of biodiversity in the genomics era Genome Biol 20181913110.1186/s 13059-018-1520-330205843 PMC 6131752 · doi ↗ · pubmed ↗
- 3IUCN Red List. https://www.iucnredlist.org/. Accessed 6 Jul 2023.
- 4Foose TJ Wiese RJ Population management of rhinoceros in captivity Int Zoo Yearbook 2006401749610.1111/j.1748-1090.2006.00174.x · doi ↗
- 5NCBI Sequence Read Archive. 2021. https://identifiers.org/ncbi/bioproject:PRJNA 777872.
- 6Kolmogorov M Yuan J Lin Y Pevzner P Assembly of long, error-prone reads using repeat graphs Nat Biotechnol 201937540610.1038/s 41587-019-0072-830936562 · doi ↗ · pubmed ↗
- 7Moodley Y Westbury MV Russo I-RM Gopalakrishnan S Rakotoarivelo A Olsen R-A Interspecific Gene Flow and the evolution of specialization in Black and White Rhinoceros Mol Biol Evol 20203731051710.1093/molbev/msaa 14832585004 · doi ↗ · pubmed ↗
- 8Genbank. https://www.ncbi.nlm.nih.gov/nuccore/JANTPW 010000000, 31 juli 2023.
