Draft genome sequence of the epiphytic Bacillus pumilus strain App11D isolated from the surface of Alhagi pseudoalhagi (Bieb.) Fisch. seeds
Vladimir K. Chebotar, Ivan A. Lunegov, Maria S. Gancheva, Dmitriy V. Kudryavtcev, Olga A. Bortsova, Elena P. Chizhevskaya, Veronika N. Pishchik

TL;DR
This paper presents the genome sequence of a bacteria found on the seeds of a specific plant.
Contribution
The study provides a new genome sequence for Bacillus pumilus strain App11D.
Findings
The genome of Bacillus pumilus App11D was sequenced using Illumina technology.
The genome was assembled de novo using SPAdes software.
Abstract
The genome sequence of the epiphytic bacterium Bacillus pumilus App11D, isolated from the seeds of Alhagi pseudoalhagi, has been obtained. The sequencing was performed using Illumina technology, and the genome was assembled de novo with SPAdes software.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Plant-Microbe Interactions and Immunity · Probiotics and Fermented Foods
ANNOUNCEMENT
The epiphytic strain App11D was isolated from the seed surface of the drought-tolerant plant Alhagi pseudoalhagi (Bieb.) Fisch., collected in the Neftekumsky district of Stavropol Territory (near the village of Achikulak, N 44°33′43″, E 44°49′59″), following the protocol described in (1). 10 g of seeds were sonicated (Bandelin, 50 Hz, 10 min) in 100 mL sterile water. Serial dilutions (0.1 mL) were plated on LB and Ashby media, followed by 3–5 days of incubation at 28°C. Genomic DNA extraction was carried out using the Wizard Genomic DNA Purification kit (Promega, USA) according to the manufacturer’s instructions. Library preparation was performed with 100 ng of extracted DNA using the KAPA HyperPlus kit (Roche, Switzerland). Final purification was conducted with KAPA Hyperpure beads (Roche, Switzerland). The library size distribution and quality assessment were evaluated with the High Sensitivity DNA kit (Agilent Technologies, USA), while the Quant-iT DNA Assay kit (Thermo Fisher Scientific, USA) was used for quantification. Sequencing was performed on the NextSeq 1000 platform (Illumina, USA) using NextSeq 1000/2000 P1 Reagents (300 cycles) with 2% PhiX spike-in control, generating 1,139,301 read pairs (150 bp length).
The quality of the raw reads was assessed using FastQC v0.11.9 (2), which indicated high-quality data with no need for additional trimming or filtering. Genome assembly was performed using SPAdes v3.14.1 (3), yielding 59 contigs with an N50 of 1,836,326 bp. All Bacillus reference genomes were downloaded from NCBI RefSeq (4), and FastANI v1.34 (5) was used to perform whole-genome comparisons between these genomes and our assembly. The App11D strain showed the highest similarity to Bacillus pumilus SAFR-032 (GCF_000017885.4) with an average nucleotide identity of 95.5%. Scaffolding was performed by aligning contigs to the reference genome with Ragtag v2.1.0 (6), followed by the removal of scaffolds shorter than 500 bp. Plasmid detection was carried out on the resulting scaffolds using PlasmidFinder-2.0 (7) and PlasFlow-1.1 (8), but no plasmids were identified. Assembly quality metrics were evaluated with QUAST v5.1.0 (9), the samtools coverage tool v1.21 (10), and BUSCO v5.2.2 (bacillales_odb10) (11). Genome coverage was calculated by mapping reads to the assembly with minimap2 v2.28 (12). The draft genome assembly achieved 94.3× coverage with only 5 scaffolds totaling 3,616,178 bp (N50 value = 3,610,638 bp). The GC content was calculated at 42.05%, and BUSCO analysis confirmed 100% completeness of the assembly. Unless otherwise specified, default software parameters were applied.
The genome annotation was carried out with the NCBI Prokaryotic Genome Annotation Pipeline v6.9 (13), resulting in the identification of 3,705 genes. Gene clusters associated with secondary metabolite biosynthesis were analyzed using antiSMASH v7.1.0 (14) with all additional features enabled. The analysis identified gene clusters predicted to encode biosynthetic pathways for lichenysin, zwittermicin A, schizokinen, fengycin, bacilysin, and bacillibactin production. Previous studies have demonstrated the potential of B. pumilus strains to promote plant growth by producing phytohormones and other bioactive compounds that improve plant biometric traits and stress tolerance (15). Given its isolation from the drought-resistant A. pseudoalhagi, strain App11D, represents a promising candidate for investigating plant growth-promotion and stress-alleviation properties.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Pishchik VN, Chizhevskaya EP, Chebotar VK, Mirskaya GV, Khomyakov YV, Vertebny VE, Kononchuk PY, Kudryavtcev DV, Bortsova OA, Lapenko NG, Tikhonovich IA. 2024. PGPB isolated from drought-tolerant plants help wheat plants to overcome osmotic stress. Plants (Basel) 13:3381. doi:10.3390/plants 1323338139683174 PMC 11644311 · doi ↗ · pubmed ↗
- 2Andrews S. 2010. Fast QC: a quality control tool for high throughput sequence data. Available from: http://www.bioinformatics.babraham.ac.uk/projects/fastqc
- 3Prjibelski A, Antipov D, Meleshko D, Lapidus A, Korobeynikov A. 2020. Using SP Ades de novo assembler. Curr Protoc Bioinformatics 70:e 102. doi:10.1002/cpbi.10232559359 · doi ↗ · pubmed ↗
- 4O’Leary NA, Wright MW, Brister JR, Ciufo S, Haddad D, Mc Veigh R, Rajput B, Robbertse B, Smith-White B, Ako-Adjei D, et al.. 2016. Reference sequence (Ref Seq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res 44:D 733–45. doi:10.1093/nar/gkv 118926553804 PMC 4702849 · doi ↗ · pubmed ↗
- 5Jain C, Rodriguez-R LM, Phillippy AM, Konstantinidis KT, Aluru S. 2018. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat Commun 9:5114. doi:10.1038/s 41467-018-07641-930504855 PMC 6269478 · doi ↗ · pubmed ↗
- 6Alonge M, Soyk S, Ramakrishnan S, Wang X, Goodwin S, Sedlazeck FJ, Lippman ZB, Schatz MC. 2019. Ra GOO: fast and accurate reference-guided scaffolding of draft genomes. Genome Biol 20:224. doi:10.1186/s 13059-019-1829-631661016 PMC 6816165 · doi ↗ · pubmed ↗
- 7Carattoli A, Zankari E, García-Fernández A, Voldby Larsen M, Lund O, Villa L, Møller Aarestrup F, Hasman H. 2014. In silico detection and typing of plasmids using Plasmid Finder and plasmid multilocus sequence typing. Antimicrob Agents Chemother 58:3895–3903. doi:10.1128/AAC.02412-1424777092 PMC 4068535 · doi ↗ · pubmed ↗
- 8Krawczyk PS, Lipinski L, Dziembowski A. 2018. Plas Flow: predicting plasmid sequences in metagenomic data using genome signatures. Nucleic Acids Res 46:e 35. doi:10.1093/nar/gkx 132129346586 PMC 5887522 · doi ↗ · pubmed ↗
