Draft genome sequence of cellulose-degrading Bacillus stercoris BHUJPV-SS7 isolated from soil mixed with wood powder
Saurabh Singh, Arthur Prudêncio de Araujo Pereira, Thierry A. Pellegrinetti, Jay Prakash Verma

TL;DR
This paper presents the genome sequence of a soil-isolated bacteria that can break down cellulose.
Contribution
The complete genome of a cellulose-degrading Bacillus stercoris strain is reported for the first time.
Findings
The genome contains 4,299 predicted genes and 4,012 protein-coding genes.
A beta-1,4-glucanase gene, important for cellulose degradation, is identified.
Abstract
We report a complete genome of Bacillus stercoris BHUJPV-SS7 isolated from soil which contains 4,299 predicted genes and 4,012 predicted protein-coding genes within its chromosome (4,115,399 bp), and has 43.51% G + C content and a predicted beta-1,4-glucanase (EC 3.2.1.4) gene.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
- —University Grants Commission (UGC)
- —Banaras Hindu University (BHU)
- —Department of Science and Technology, Ministry of Science and Technology, India (DST)
- —Banaras Hindu University (BHU)
- —DST | Science and Engineering Research Board (SERB)
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Bacteriophages and microbial interactions · Bacterial Genetics and Biotechnology
ANNOUNCEMENT
Bacillus spp. commonly found in the rhizosphere soils (1) are known to harbor many beneficial properties including the ability of cellulose degradation (2). The bacteria described could be a potent cellulose degrader and hence whole-genome sequencing of Bacillus stercoris BHUJPV-SS7 was done.
Bacillus stercoris BHUJPV-SS7 was isolated from rhizosphere soil [collection depth 10 cm from surface (25.267270 N, 82.987285 E)], near a mango tree, recently fallen, and trunk degraded from inside leading to wood powder mixing with soil. The soil and pruned wood powder, collected simultaneously, were homogenized using a handheld homogenizer, serially diluted in 0.85% NaCl water, and spread after dilutions (from 10^−2^ to 10^−7^) on CMC (carboxymethyl cellulose) agar plates (0.5 g KH_2_PO_4_, 0.25 g MgSO_4_, 2 g CMC, 15 g agar, 0.2 g Congo-Red, and 2 g gelatin; 1,000 mL water at pH 6.8–7.2) and incubated for 5 days at 30°C. Subsequent subculturing (three times) in the same media produced multiple colonies with good zones (>15 mm clearance) of cellulose degradation and were selected for further assessment. Single colonies were cultured in nutrient agar (peptone 5 g/L, beef extract 3 g/L, agar 20 g/L, NaCI 0.5 g/L) media and incubated in a biological oxygen demand incubator for 72 h at 30°C. Enzyme quantification was performed on selected strains to select BHUJPV-SS7 for whole-genome sequencing.
Genomic DNA isolated using QIAGEN kit (cat. no: 51304) was checked using a Qubit 3 Fluorometer with dS DNA HS Dye (Thermo Fisher Scientific, Waltham, MA, USA). DNA samples were purified using AMPure beads, and enriched through PCR amplification (six cycles), using NEBNext Ultra II Q5 Master Mix (New England Biolabs), Illumina universal primer, and sample-specific octamer primers. Fragment analysis was done on Agilent 2100 Bioanalyzer with 1 µL library, sequenced on Illumina HiSeq 4000 (Illumina, Inc., San Diego, CA, USA).
The raw sequences were quality controlled using Fastqc v.0.11.9 (3) and Multiqc v.1.10.1 (4) and the sequence adapters were removed using Trimgalore v.0.6.6 (5). Default parameters were employed for all softwares in this study. The trimmed reads were assembled de novo using the Unicycler v.0.4.8 assembler (6). The final assembly was functionally annotated using RASTtk v.1.073 (7) and by NCBI Prokaryotic Genome Pipeline v.6.6 (8). Genome quality and taxonomic validation, including average nucleotide identity (ANI), were conducted using CheckM v.1.2.2 (9) and fastANI v.1.33 (10). The presence of beta-1,4-glucanase gene was predicted using dbCAN2 HMMs of CAZy v.1.9.1 families—v.10 in the KBase platform (11–13).
17,727,601 paired-reads of 151 bp were generated in this project. The draft genome measured 4,117,388 bp with 43.51% of average G + C content and median coverage of 1,110×. Genome assembly revealed 29 contigs with N50 value of 912.3 kilobases. The taxonomic annotation of the draft genome of strain BHUJPV-SS7 was most closely affiliated to Bacillus stercoris (ASM2055188v1) with 98.76% identical by ANI results. A total of 62 genes linked to carbohydrate degradation were predicted, including glycoside hydrolases, one gene for polyphenolic degradation, one for lignin degradation, and 11 lignin degradation auxiliary genes indicating potential for cellulose degradation and biomass biovalorization.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Meena VS, Maurya B, Meena SK, Meena RK, Kumar A, Verma J, Singh N. 2016. Can Bacillus species enhance nutrient availability in agricultural soils?, p 367–395. In Bacilli and Agrobiotechnology. doi:10.1007/978-3-319-44409-3_16. · doi ↗
- 2Li Y-H, Ding M, Wang J, Xu G-J, Zhao F. 2006. A novel thermoacidophilic endoglucanase, Ba-EGA, from a new cellulose-degrading bacterium, Bacillus sp. AC-1. Appl Microbiol Biotechnol 70:430–436. doi:10.1007/s 00253-005-0075-x 16142468 · doi ↗ · pubmed ↗
- 3Andrews SF, Krueger F, Seconds-Pichon A, Biggins F, Wingett SF. 2014. A quality control tool for high throughput sequence data. Babraham Bioinformatics
- 4Ewels P, Magnusson M, Lundin S, Käller M. 2016. Multi QC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics 32:3047–3048. doi:10.1093/bioinformatics/btw 35427312411 PMC 5039924 · doi ↗ · pubmed ↗
- 5Krueger F. 2012. Trim Galore: a wrapper tool around Cutadapt and Fast QC to consistently apply quality and adapter trimming to Fast Q files, with some extra functionality for Msp I-digested RRBS-type (Reduced Representation Bisufite-Seq) libraries. Available from: http://www. bioinformatics.babraham.ac.uk/projects/trim_galore/. Retrieved 3 Sep 2023.
- 6Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. P Lo S Comput Biol 13:e 1005595. doi:10.1371/journal.pcbi.100559528594827 PMC 5481147 · doi ↗ · pubmed ↗
- 7Brettin T, Davis JJ, Disz T, Edwards RA, Gerdes S, Olsen GJ, Olson R, Overbeek R, Parrello B, Pusch GD, Shukla M, Thomason JA, Stevens R, Vonstein V, Wattam AR, Xia F. 2015. RAS Ttk: a modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes. Sci Rep 5:8365. doi:10.1038/srep 0836525666585 PMC 4322359 · doi ↗ · pubmed ↗
- 8Tatusova T, Di Cuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res 44:6614–6624. doi:10.1093/nar/gkw 56927342282 PMC 5001611 · doi ↗ · pubmed ↗
