Complete genome sequence of strain M4-SHS-6, a potential new member of the genus Acidithiobacillus
Minjie Zhou, Tiantian Yu, Xiaomin Lan, Sidong Zhu, Jigang Chen

TL;DR
This paper presents the full genome sequence of a new sulfur-oxidizing bacterium, M4-SHS-6, which could belong to the Acidithiobacillus genus.
Contribution
The study provides a complete genome sequence of a potential new Acidithiobacillus strain using hybrid sequencing technology.
Findings
The genome includes one circular chromosome and two circular plasmids.
Hybrid sequencing (Illumina and Nanopore) was used for genome assembly.
Abstract
Here, we report the complete genome sequence of strain M4-SHS-6, a potential chemolithoautotrophic sulfur-oxidizing bacterium. The genome comprises one circular chromosome and two circular plasmids, assembled through hybrid sequencing (Illumina and Nanopore).
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
- —The First-Class Discipline of Biological Engineering of Zhejiang Province
- —The Zhejiang Provincial Basic Public Welfare Research Program
- —The Ningbo Public Welfare Technology Researh Program
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetal Extraction and Bioleaching · Corrosion Behavior and Inhibition · Genomics and Phylogenetic Studies
ANNOUNCEMENT
Strain M4-SHS-6 was isolated from a surface water sample collected in April 2018 from an artificial lake at Zhejiang Wanli University, China (GPS: 121.6°E, 29.8°N). A 100 µL aliquot of the sample was enriched in 10 mL of liquid DSMZ medium 68 (pH 6.5) containing Na_2_S_2_O_3_ as the sole energy source. The enrichment culture exhibiting a significant pH decrease was transferred to DSMZ medium 68-agar via dilution plating. A colony (designated M4-SHS-6) showing a yellow halo zone on DSMZ medium 68-agar plates after 7 days of incubation at 30°C was purified through repeated streaking.
Colony PCR targeting the nearly full-length 16S rRNA gene was performed using universal bacterial primers 27F and 1492R (1). The 16S rRNA sequence was manually validated and deposited in GenBank (accession no. PQ637443). EzTaxon-e analysis (2) indicated that strain M4-SHS-6 shares 94.2%–98.4% sequence similarity with type strains of the genus Acidithiobacillus, with the closest relative being Acidithiobacillus ferridurans ATCC 33020^T^ (98.4% identity; GenBank accession no. AJ278719).
Genomic DNA extracted from exponential phase cells (CTAB method(3)) was used for Illumina and Oxford Nanopore sequencing. The Illumina library was prepared using the NEXTflex Rapid DNA-Seq Kit (Bioo Scientific, USA) and sequenced on the Illumina HiSeq X Ten platform (Illumina Inc., USA). Approximately 1.46 Gb of raw data of 150-bp-long paired-end reads was generated. A total of 4,837,194 reads were subjected to control and trimming using SOAPnuke version 1.3.0 (4), generating a total of 4,811,505 clean reads. For Nanopore sequencing, DNA was sheared using g-TUBE, and large DNA fragments (>20 Kb) were selected using BluPippin (Sage Science, Beverly, MA, USA). The Nanopore library was prepared using the SQK-LSK109 kit (Oxford Nanopore Technologies, UK) and loaded into an R9.4.1 flow cell for the PromethION platform (Oxford Nanopore Technologies, UK). The base calling was performed using Guppy version 4.0.15 (Oxford Nanopore Technologies). Read QC was achieved using NanoPlot version 1.15.0 with a threshold value (Q) of >7 (5). Nanopore sequencing yielded 251,745 clean reads, with an average length of 5,660 bp and an N50 value of 8,791 bp.
Hybrid assembly of both read types was performed using Unicycler version 0.4.8 (6) and resulted in a single circular chromosome (3,003,844 bp) and two circular plasmids, pLA (8,964 bp) and pLB (3,693 bp), with an overall G + C content of 56.1%. We stated that pLA and pLB are plasmids verified by Plasflow (7) and PLSDB database (8). Genome coverage reached 99.68%, with average depths of 482.6× (Illumina) and 472.4× (Nanopore). For all software, default parameters were used, except where otherwise noted. Genome annotation via NCBI PGAP (9) predicted 3,144 genes, including 2,993 protein-coding sequences. The average nucleotide identity between M4-SHS-6 and Acidithiobacillus type strains ranged from 70.1% to 80.5%, below the interspecies threshold (95.0% (10)).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Frank JA, Reich CI, Sharma S, Weisbaum JS, Wilson BA, Olsen GJ. 2008. Critical evaluation of two primers commonly used for amplification of bacterial 16S r RNA genes. Appl Environ Microbiol 74:2461–2470. doi:10.1128/AEM.02272-0718296538 PMC 2293150 · doi ↗ · pubmed ↗
- 2Kim OS, Cho YJ, Lee K, Yoon SH, Kim M, Na H, Park SC, Jeon YS, Lee JH, Yi H, Won S, Chun J. 2012. Introducing Ez Taxon-e: a prokaryotic 16S r RNA gene sequence database with phylotypes that represent uncultured species. Int J Syst Evol Microbiol 62:716–721. doi:10.1099/ijs.0.038075-022140171 · doi ↗ · pubmed ↗
- 3Wang X, Lin D, Jing X, Zhu S, Yang J, Chen J. 2018. Complete genome sequence of the highly Mn(II) tolerant Staphylococcus sp. Anti Mn-1 isolated from deep-sea sediment in the Clarion-Clipperton Zone. J Biotechnol 266:34–38. doi:10.1016/j.jbiotec.2017.12.00429223718 · doi ↗ · pubmed ↗
- 4Chen Y, Chen Y, Shi C, Huang Z, Zhang Y, Li S, Li Y, Ye J, Yu C, Li Z, Zhang X, Wang J, Yang H, Fang L, Chen Q. 2018. SOA Pnuke: a Map Reduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. Gigascience 7:1–6. doi:10.1093/gigascience/gix 120PMC 578806829220494 · doi ↗ · pubmed ↗
- 5De Coster W, D’Hert S, Schultz DT, Cruts M, Van Broeckhoven C. 2018. Nano Pack: visualizing and processing long-read sequencing data. Bioinformatics 34:2666–2669. doi:10.1093/bioinformatics/bty 14929547981 PMC 6061794 · doi ↗ · pubmed ↗
- 6Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. P Lo S Comput Biol 13:e 1005595. doi:10.1371/journal.pcbi.100559528594827 PMC 5481147 · doi ↗ · pubmed ↗
- 7Krawczyk PS, Lipinski L, Dziembowski A. 2018. Plas Flow: predicting plasmid sequences in metagenomic data using genome signatures. Nucleic Acids Res 46:e 35. doi:10.1093/nar/gkx 132129346586 PMC 5887522 · doi ↗ · pubmed ↗
- 8Molano L-AG, Hirsch P, Hannig M, Müller R, Keller A. 2025. The PLSDB 2025 update: enhanced annotations and improved functionality for comprehensive plasmid research. Nucleic Acids Res 53:D 189–D 196. doi:10.1093/nar/gkae 109539565221 PMC 11701622 · doi ↗ · pubmed ↗
