Draft genome sequence of Bacillus sp. strain X and Salarachaeum sp. strain III isolated from Lake Karum, Danakil Depression, Ethiopia
Ermias Sissay Balcha, Karen Olsson-Francis, Ben Stephens, Barbara Cavalazzi, Adugna Abdi Woldesemayat, Mesfin Tafesse Gemeda, Michael C. Macey

TL;DR
This paper presents draft genome sequences of two bacterial strains from a hypersaline lake in Ethiopia, which could help study their biological activities.
Contribution
The novel contribution is the first draft genome sequences of Bacillus and Salarachaeum strains from Lake Karum.
Findings
Draft genomes of two bacterial strains from Lake Karum were successfully sequenced.
The genomes may aid in understanding the organisms' potential for secondary metabolite production.
Abstract
Here, we report the draft genome sequences of strains of Bacillus and Salarachaeum that were isolated from hypersaline water samples collected from Lake Karum, Danakil Depression, Ethiopia. The sequences pave the way for more targeted studies into the potential biological activities and secondary metabolite synthesis of these organisms.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Microbial Community Ecology and Physiology · Bacteriophages and microbial interactions
ANNOUNCEMENT
Lake Karum, also known as Lake Assale, is located in the Danakil Depression, northern Ethiopia. It is a brine lake (28%–33% salinity) that lies 120 m below sea level with temperatures ranging between 34°C and 36°C (1, 2). Previous studies have shown that Lake Karum supports a diverse microbial community (3, 4). Here, we present the draft genomes of Bacillus sp. strain X and Salarachaeum sp. strain III. The strains were isolated from water samples collected using falcon tubes from a depth of 5 cm in Lake Karum (14.0207° N, 40.4007° E).
The water samples were enriched with Soya Flour Medium (SFM) containing 4 M NaCl. Serial dilutions were prepared and plated onto soy flour agar and incubated at 30°C for 30 days. Distinct colonies were observed, leading to isolation of Bacillus sp. strain X and Salarachaeum sp. strain III (5). Pure colonies were used to culture in SFM, from which DNA was extracted using Griffiths technique (6). DNA libraries were prepared by MicrobesNG using Nextera XT Library Prep Kit. Paired-ends (2 × 250 bp) genomic sequencing was carried out by MicrobesNG (Birmingham, UK) using Illumina HiSeq technology. Raw reads were trimmed using Trimmomatic (v0.30) (quality cutoff of Q15) (7), and de novo assembly was carried out with SPAdes (v3.7) (8). Genome quality was assessed with CheckM (9). Genome coverage was calculated using BWA, SAMtools (0.1.19), and BEDTools genomecov (2.2.7) (10–12). Genome annotation was performed using the Rapid Annotations Subsystems Technology (RAST) annotation server (v2.0) with the classic RAST pipeline (13). The presence of metabolic pathways was further screened using BlastKoala (2.3) (14).
Bacillus sp. strain X was most closely related to Bacillus subtilis, with an average nucleotide identity of 98.26% as determined by ANItools (15). The draft genomes of Bacillus sp. strain X yielded 17 contigs, including 2,322 coding sequences (CDSs), 44.92% GC content, and 3,391,284 bp genome size, with 160-fold coverage. Salarachaeum sp. strain III X was closely related to Halobacteriales, with 91.36% nucleotide sequence identity. The genome comprised 129 contigs, including 943 CDSs with 67.50% GC content. The genome size was 2,034,915 bp with 30-fold coverage. CheckM analysis identified completion and contamination values of 98.73% and 0.32% for the Bacillus genome and 99.28% and 1.52% for the Salarachaeum genome.
Genome mining of Bacillus sp. strain X was carried out with antibiotics and Secondary Metabolites Analysis SHell (antiSMASH, v6.0) program (16) and showed the presence of 13 (44.8 %) non-ribosomal peptide gene clusters that encoded a variety of secondary metabolites that produce enzymes, putative antimicrobials, and other bioactive compounds of significant importance in the biotechnological and medical sectors (17). Gene clusters encoding ribosomally synthesized and post-translationally modified peptides and terpenes were also identified from Salarachaeum sp. strain III. Regarding biogeochemical cycling, genes for the complete assimilatory sulfate and dissimilatory nitrate pathways were identified in the genome of Bacillus sp. strain X, while the genome of Salarachaeum sp. strain III contained putative formate dehydrogenase (NAD-dependent) and carbon monoxide dehydrogenase-encoding genes.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Abbate E, Bruni P, Sagri M. 2015. Geology of Ethiopia: a review and geomorphological perspectives, p 33–64. In World geomorphological landscapes. Springer.
- 2Cavalazzi B, Barbieri R, Gómez F, Capaccioni B, Olsson-Francis K, Pondrelli M, Rossi AP, Hickman-Lewis K, Agangi A, Gasparotto G, Glamoclija M, Ori GG, Rodriguez N, Hagos M. 2019. The Dallol geothermal area, Northern Afar (Ethiopia)-an exceptional planetary field analog on earth. Astrobiology 19:553–578. doi:10.1089/ast.2018.192630653331 PMC 6459281 · doi ↗ · pubmed ↗
- 3Belilla J, Moreira D, Jardillier L, Reboul G, Benzerara K, López-García JM, Bertolino P, López-Archilla AI, López-García P. 2019. Hyperdiverse archaea near life limits at the polyextreme geothermal Dallol area. Nat Ecol Evol 3:1552–1561. doi:10.1038/s 41559-019-1005-031666740 PMC 6837875 · doi ↗ · pubmed ↗
- 4Gómez F, Cavalazzi B, Rodríguez N, Amils R, Ori GG, Olsson-Francis K, Escudero C, Martínez JM, Miruts H. 2019. Ultra-small microorganisms in the polyextreme conditions of the Dallol volcano, Northern Afar, Ethiopia. Sci Rep 9:7907. doi:10.1038/s 41598-019-44440-831133675 PMC 6536532 · doi ↗ · pubmed ↗
- 5Ravimannan N, Pathmanathan S. 2016. Soy flour as alternative culture media for yeasts. Available from: https://www.researchgate.net/publication/315836118
- 6Griffiths RI, Whiteley AS, O’Donnell AG, Bailey MJ. 2000. Rapid method for coextraction of DNA and RNA from natural environments for analysis of ribosomal DNA- and r RNA-based microbial community composition. Appl Environ Microbiol 66:5488–5491. doi:10.1128/AEM.66.12.5488-5491.200011097934 PMC 92488 · doi ↗ · pubmed ↗
- 7Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for illumina sequence data. Bioinformatics 30:2114–2120. doi:10.1093/bioinformatics/btu 17024695404 PMC 4103590 · doi ↗ · pubmed ↗
- 8Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SP Ades: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477. doi:10.1089/cmb.2012.002122506599 PMC 3342519 · doi ↗ · pubmed ↗
