Whole-genome sequences of six Borrelia recurrentis strains obtained via PacBio sequencing
Alhussien M. Gaber, John C. Blazier, Artem S. Rogovskyy

TL;DR
This paper presents the whole-genome sequences of six Borrelia recurrentis strains linked to louse-borne relapsing fever.
Contribution
The study provides new whole-genome sequences of six Borrelia recurrentis strains using PacBio sequencing.
Findings
Each genome includes one linear chromosome and five linear plasmids.
The average genome size is 1,284,895 bp with a GC content of 27.5%.
Abstract
Provided are whole-genome sequences of six Borrelia recurrentis strains that had been earlier isolated from louse-borne relapsing fever patients. The sequences of each genome presented here included one linear chromosome and 5 linear plasmids, whose average size was 1,284,895 bp with the mean GC content being 27.5%.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
| A1 | A11 | A17 | PAbJ | PBeK | PAbN | |
|---|---|---|---|---|---|---|
| Contig assembly statistics | ||||||
| Total bases | 4,652,507,565 | 3,108,737,453 | 5,393,071,144 | 4,434,757,658 | 4,936,239,489 | 4,851,771,615 |
| Number of reads | 370,257 | 254,752 | 421,692 | 363,105 | 393,574 | 393,519 |
| N50 read length | 12,594 | 12,242 | 12,809 | 12,257 | 12,567 | 12,338 |
| N50 contig length | 932,334 | 932,334 | 932,354 | 932,347 | 932,332 | 932,342 |
| Average coverage (X) | 3,639 | 2,410 | 4,199 | 3,440 | 3,844 | 3,778 |
| Total DNA size | 1,278,199 | 1,289,858 | 1,284,120 | 1,288,942 | 1,284,107 | 1,284,144 |
| Genome features | ||||||
| Chromosome | 932,334 | 932,334 | 932,354 | 932,347 | 932,332 | 932,342 |
| lp190 | 190,003 | 190,002 | 190,010 | 189,996 | 189,995 | 189,998 |
| lp55 | 55,036 | 55,033 | 55,035 | 55,042 | 55,042 | 55,045 |
| lp42 | 42,100 | 42,085 | 42,100 | 42,120 | 42,116 | 42,121 |
| lp35 | 35,472 | 35,483 | 35,480 | 35,475 | 35,475 | 35,483 |
| lp23 | 23,254 | 34,921 | 29,141 | 33,962 | 29,147 | 29,155 |
| Total genes | 1,260 | 1,275 | 1,268 | 1,272 | 1,268 | 1,265 |
| Total GC content (%) | 27.4 | 27.5 | 27.5 | 27.5 | 27.5 | 27.5 |
| SRA accession numbers |
|
|
|
|
|
|
| Gene bank accession numbers |
|
|
|
|
|
|
- —HHS | NIH | National Institute of Allergy and Infectious Diseases (NIAID)
- —HHS | NIH | National Institute of Allergy and Infectious Diseases (NIAID)
- —HHS | NIH | National Institute of Allergy and Infectious Diseases (NIAID)
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVector-borne infectious diseases · Viral Infections and Vectors · Mosquito-borne diseases and control
ANNOUNCEMENT
Louse-borne relapsing fever (LBRF) is a vector-borne disease whose morbidity and mortality are significant in some African countries (1–5). LBRF is caused by Borrelia recurrentis, the spirochetal bacterium that affects only humans with no known animal reservoir (5, 6). The genome of B. recurrentis strain A1 previously obtained by shotgun approach was shown to comprise a 0.9 Mb chromosome and 7 linear plasmids (lp124, lp53, lp37, lp35, lp33, lp23, lp6) (7). A more recent study that utilized Illumina MiSeq technology to sequence six additional B. recurrentis strains (A17, PBeK, PAbJ, PAbN, PMaC, PUfA) demonstrated almost identical genome compositions (8). The exception was lp124, whose size was consistently larger for the six strains (8), supporting previously generated pulsed-field gel electrophoresis data (9–12). This discrepancy, and an inherent failure of both sequencing methods to accurately define the B. recurrentis antigenic variation system because of its highly homologous elements (13), prompted us to re-sequence strains A1, A11, A17, PBeK, PAbJ, and PAbN by using the long-read PacBio technology.
The origins of strains are as follow. The A strains were isolated from the blood of LBRF patients originated from Ethiopia in the 1990s (9, 10). PBeK and PAbN were isolated in Germany in 2004 and 2015 from one LBRF patient returning and the other migrating from Ethiopia, respectively (8). PAbJ was isolated in Germany in 2015 from a LBRF patient originated from Somali (8).
B. recurrentis was propagated to ~5 × 10^7^–1 × 10^8^ cells/mL in home-made Barbour-Stoenner-Kelly II medium supplemented with 10% (vol/vol) rabbit serum (Sigma-Aldrich, MO, USA) and 1.4% (wt/vol) gelatin (Sigma-Aldrich, MO, USA) at 35°C under 2.5% CO_2_ (14), and pellets were shipped to Maryland Genomics, Institute for Genome Sciences (IGS), University of Maryland School of Medicine (MD, Baltimore, USA) for DNA extraction and sequencing. DNA samples were extracted using the Qiagen MagAtract HMW DNA kit (Qiagen, MI, USA) and sheared to an average size of 10–15 kb. Libraries prepared using the SMRTbell Express Template Prep Kit 2.0 (Pacific Biosciences, CA, USA) were size-selected on the BluePippin instrument (Sage Science, MA, USA) to remove fragments of <10 kb, which excluded the smallest previously sequenced plasmid, lp6 (7, 8). The library pool was sequenced with Sequel II Sequencing Kit 2.0 and 8M SMRT cell on the Sequel II instrument (Pacific Bioscience, CA, USA).
De-novo assemblies were performed using IPA (SMRTtools version 11.0.0) at IGS. On the Grace computing cluster at Texas A&M University, PacBio reads were downsampled to 25K sequences per sample using seqtk (version 1.3) due to the extremely high coverage of the data set (Table 1). The downsampled data were then assembled in Flye (version 2.9.1) under default parameters. Contigs were annotated using the NCBI Prokaryotic Genome Annotation Pipeline (15).
Each assembled B. recurrentis genome contained one chromosome and 5 linear plasmids (Table 1). To determine the completeness of all chromosomes and plasmids, we identified B. recurrentis-specific telomeric box 3 sequences (5′-TTAGTATA-3′ and 5′-TATACTAA-3′), which are conserved inverted terminal repeats recognized by telomere resolvase during borrelial replication (16–19). As a result, previously sequenced lp124 and lp37 of strain A1 (7) were found to constitute a single plasmid, lp190. Overall, the present data demonstrated that the B. recurrentis chromosomes and four largest plasmids, lp190, lp55, lp42, and lp35, had similar lengths in all the strains, whereas lp23 size varied considerably between the six strains.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Yimer M, Mulu W, Ayalew W, Abera B. 2014. Louse-borne relapsing fever profile at Felegehiwot referral hospital, Bahir Dar city, Ethiopia: a retrospective study. BMC Res Notes 7:250. doi:10.1186/1756-0500-7-25024742342 PMC 4065083 · doi ↗ · pubmed ↗
- 2Borgnolo G, Hailu B, Ciancarelli A, Almaviva M, Woldemariam T. 1993. Louse-borne relapsing fever. A clinical and an epidemiological study of 389 patients in Asella Hospital, Ethiopia. Trop Geogr Med 45:66–69.8511813 · pubmed ↗
- 3Salih SY, Mustafa D, Abdel Wahab SM, Ahmed MA, Omer A. 1977. Louse-borne relapsing fever: I. A clinical and laboratory study of 363 cases in the Sudan. Trans R Soc Trop Med Hyg 71:43–48. doi:10.1016/0035-9203(77)90206-1871032 · doi ↗ · pubmed ↗
- 4de Jong J, Wilkinson RJ, Schaeffers P, Sondorp HE, Davidson RN. 1995. Louse-borne relapsing fever in southern Sudan. Trans R Soc Trop Med Hyg 89:621. doi:10.1016/0035-9203(95)90414-x 8594674 · doi ↗ · pubmed ↗
- 5Cutler SJ. 2006. Possibilities for relapsing fever reemergence. Emerg Infect Dis 12:369–374. doi:10.3201/eid 1203.05089916704771 PMC 3291445 · doi ↗ · pubmed ↗
- 6Warrell DA. 2019. Louse-borne relapsing fever (Borrelia recurrentis infection). Epidemiol Infect 147:e 106. doi:10.1017/S 095026881900011630869050 PMC 6518520 · doi ↗ · pubmed ↗
- 7Lescot M, Audic S, Robert C, Nguyen TT, Blanc G, Cutler SJ, Wincker P, Couloux A, Claverie JM, Raoult D, Drancourt M. 2008. The genome of Borrelia recurrentis, the agent of deadly louse-borne relapsing fever, is a degraded subset of tick-borne Borrelia duttonii. P Lo S Genet 4:e 1000185. doi:10.1371/journal.pgen.100018518787695 PMC 2525819 · doi ↗ · pubmed ↗
- 8Marosevic D, Margos G, Wallich R, Wieser A, Sing A, Fingerle V. 2017. First insights in the variability of Borrelia recurrentis genomes. P Lo S Negl Trop Dis 11:e 0005865. doi:10.1371/journal.pntd.000586528902847 PMC 5612729 · doi ↗ · pubmed ↗
