Draft genome sequence of benzo[a]pyrene degrading Bacillus altitudinis strain AR19 isolated from Digboi oil refinery (India)
Abhisek Dasgupta, Ratul Saikia, Bibhuti B. Kakoti, Pratap Jyoti Handique

TL;DR
Scientists sequenced the genome of a bacteria that can break down a harmful chemical found in oil, which could help in cleaning up oil pollution.
Contribution
The paper presents the first draft genome sequence of a benzo[a]pyrene-degrading Bacillus altitudinis strain from an oil refinery environment.
Findings
The genome contains genes for ring-cleaving dioxygenases, which may help degrade benzo[a]pyrene.
The genome includes genes related to biofilm formation, aiding survival in hydrocarbon-rich environments.
Abstract
Bacillus altitudinis AR19, isolated from the Digboi oil refinery (India), has a genome size of 3,630,000 bp with a G+C content of 42.45%. The genome encodes 3,755 protein-coding genes, including those for ring-cleaving dioxygenases and biofilm formation. These genes likely play crucial roles in the bacterium’s survival in hydrocarbon-enriched environments.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
- —Council of Scientific and Industrial Research, Indiahttp://dx.doi.org/10.13039/501100001412
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBacterial biofilms and quorum sensing · Genomics and Phylogenetic Studies · Microbial bioremediation and biosurfactants
ANNOUNCEMENT
Benzo[a]pyrene degrading Bacillus altitudinis strain AR19 was isolated from soil sample collected from Digboi oil refinery (27°23′27″N 95°37′07″E) situated in India. Sixteen polycyclic aromatic hydrocarbons (PAHs) were recognized by the US Environmental Protection Agency as significant environmental pollutants (1), out of which benzo[a]pyrene (B[a]P) is one of them. B[a]P, which belongs to high molecular weight PAHs, is also considered to be a potent carcinogen to humans, as recognized by the International Agency for Research on Cancer (2). B. altitudinis has been found to degrade aromatic compounds found in hydrocarbon pollutants (3). B. altitudinis species belongs to the rod shape, gram-positive bacteria classified under Firmicutes (4). It was first reported to be isolated from air samples collected from the stratosphere. It has diverse habitat (4). B. altitudinis strain AR19 was isolated using the serial dilution method (at 37°C) and screened against benzo[a]pyrene (Sigma-Aldrich) using spray plate technique in Bushnell Haas Agar (HIMEDIA) (5). AR19 was found to be positive against benzo[a]pyrene. Overnight culture of 48 hours of B. altitudinis strain AR19 was used to extract the genomic DNA. GeNei Bacterial DNA purification kit (Genei Laboratories Private Limited, Bengaluru, India) was used for DNA extraction, and isolation of the genomic DNA of B. altitudinis strain AR19 was done using the manufacturer’s protocol. The genomic DNA was quantified by BioSpectrometer (Eppendorf, Germany).
The genomic DNA of B. altitudinis strain AR19 was outsourced to AgriGenome Labs Pvt Ltd (India) for sequencing. The library was prepared using the Illumina DNA Prep kit (Illumina, California, USA) following the manufacturer’s protocol. Sequencing was done on the Illumina HiSeq 2500 platform. Paired-end reads were generated with read length of 250 bp. The total reads generated were 4,199,382. Quality check of the raw reads was done by the FastQC tool (6). The adapter sequence was removed by Cutadapt v1.8 (7). Fastp was used to join the trimmed reads (8). Kmergenie was used to check the kmers for de novo assembly, which was done in SPAdes v. 3.13.1 (9, 10). The genome assembly was evaluated using Quast (11). The genome assembly was done using k-mer size of 67, 69, 71, 73, 75, and 77. QUAST reported predicted that k-mer of size 73 was the best assembly with the largest contig size of 299,197 bp and N50 equal to 94,649 bp and the total number of contigs equal to 85. The assembly size was 3,630,000 bp. GTDB-Tk (Genome Taxonomy Database) tool was used for phylogenetic analysis (12). The AR19 genome also showed closest reference with B. altitudinis. The genome was annotated using the NCBI Prokaryotic Genome Annotation Pipeline (13). For all the software used in this study, default parameters were used except where otherwise noted.
The genome of B. altitudinis strain AR19 encodes 3,849 total number of genes, out of which 3,755 were protein-coding genes and 72 RNA genes. Notable proteins include catechol 2,3-dioxygenase, mechanosensitive channel and putative ring-cleaving dioxygenase (MhqO). These proteins may contribute to the strain’s resistance to benzo[a]pyrene (14–16).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Yuan SY, Wei SH, Chang BV. 2000. Biodegradation of polycyclic aromatic hydrocarbons by a mixed culture. Chemosphere 41:1463–1468. doi:10.1016/s 0045-6535(99)00522-611057584 · doi ↗ · pubmed ↗
- 2Hardonnière K, Saunier E, Lemarié A, Fernier M, Gallais I, Héliès-Toussaint C, Mograbi B, Antonio S, Bénit P, Rustin P, Janin M, Habarou F, Ottolenghi C, Lavault M-T, Benelli C, Sergent O, Huc L, Bortoli S, Lagadic-Gossmann D. 2016. The environmental carcinogen benzo[a]pyrene induces a warburg-like metabolic reprogramming dependent on NHE 1 and associated with cell survival. Sci Rep 6:30776. doi:10.1038/srep 3077627488617 PMC 4973274 · doi ↗ · pubmed ↗
- 3Aziz AA, Zulbahri AS, Sharif MF, Mohamad MAN, Gan HM, Azizan NH. 2022. Genome sequence dataset of Bacillus altitudinis strain ST 14 isolated from Tunggak River in Gebeng Industrial Park. Vol. 45. Data Brief, Kuantan, Pahang.10.1016/j.dib.2022.108718 PMC 982328236624876 · doi ↗ · pubmed ↗
- 4Vettath VK, Junqueira ACM, Uchida A, Purbojati RW, Houghton JNI, Chénard C, Drautz-Moses DI, Wong A, Kolundžija S, Clare ME, Lau KJX, Gaultier NE, Heinle CE, Premkrishnan BNV, Gusareva ES, Acerbi E, Yang L, Schuster SC. 2017. Complete genome sequence of Bacillus altitudinis type strain SG Air 0031 isolated from tropical air collected in Singapore. Genome Announc 5:e 01260-17. doi:10.1128/genome A.01260-1729122878 PMC 5679811 · doi ↗ · pubmed ↗
- 5Zhao B, Wang H, Mao X, Li R. 2009. A rapid screening method for bacteria degrading polycyclic aromatic hydrocarbons. Lett Appl Microbiol 49:408–410. doi:10.1111/j.1472-765X.2009.02668.x 19627476 · doi ↗ · pubmed ↗
- 6Leggett RM, Ramirez-Gonzalez RH, Clavijo BJ, Waite D, Davey RP. 2013. Sequencing quality assessment tools to enable data-driven informatics for high throughput genomics. Front Genet 4:288. doi:10.3389/fgene.2013.0028824381581 PMC 3865868 · doi ↗ · pubmed ↗
- 7Criscuolo A, Brisse S. 2014. Alien Trimmer removes adapter oligonucleotides with high sensitivity in short-insert paired-end reads commentary on turner (2014) assessment of insert sizes and adapter content in FASTQ data from Nextera XT libraries. Front Genet 5:130. doi:10.3389/fgene.2014.0013024860597 PMC 4026695 · doi ↗ · pubmed ↗
- 8Chen S, Zhou Y, Chen Y, Gu J. 2018. Fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34:i 884–i 890. doi:10.1093/bioinformatics/bty 56030423086 PMC 6129281 · doi ↗ · pubmed ↗
