Draft genome sequence of Acerihabitans sp. strain TG2T, isolated from an Arctic tundra soil sample

V. A. Shcherbakova; A. G. Zakharyuk; V. E. Trubitsyn

PMC · DOI:10.1128/mra.00024-24·April 15, 2024

Draft genome sequence of Acerihabitans sp. strain TG2T, isolated from an Arctic tundra soil sample

V. A. Shcherbakova, A. G. Zakharyuk, V. E. Trubitsyn

PDF

Open Access

TL;DR

This paper presents the draft genome sequence of a cold-loving bacterium isolated from Arctic tundra soil.

Contribution

The study provides the first draft genome sequence of the psychrophilic bacterium Acerihabitans sp. strain TG2T.

Findings

01

The genome of Acerihabitans sp. strain TG2T is approximately 5.3 Mb in size.

02

The genome was annotated to support further studies on its adaptation to cold environments.

Abstract

Acerihabitans sp. type strain TG2T (VKM B-3773T) is a gram-negative, anaerobic psychrophilic bacterium that was isolated from a tundra soil sample selected by the Bykovsky Peninsula (Russia). This report describes the generation and annotation of the 5.3 Mb draft genome sequence of Acerihabitans sp. strain TG2T.

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Cell lines1

TG2T.— Mesocricetus auratus (Golden hamster) · Transformed cell line

Funding1

—Russian Science Foundation (RSF)

Keywords

genome analysisArcticapsychrophiles

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMicrobial Community Ecology and Physiology · Polar Research and Ecology · Genomics and Phylogenetic Studies

Full text

ANNOUNCEMENT

An anaerobic psychrophilic bacterium, Acerihabitans sp. type strain TG2^T^ (VKM B-3773^T^), was isolated from the upper horizon of a tundra soil sample (depth 8–17 cm) collected by the western part of the Ivashkina lagoon by the Bykovsky Peninsula, Russia in 2021 (71.72N 129.28E). Strain isolation was performed according to the Hungate anaerobic method (1) by the dilution method on medium (2) with xylose as the substrate. Colonies were obtained by the “roll-tube” method using 2% (wt/vol) agar medium. Cells of the strain were gram-negative, motile, non-sporulating short rods. Strain TG2^T^ grew in the temperature range from 0 to 25°C (optimum 8°C) and at pH 6.0–8.0 (optimum pH 7.5).

The strain was grown from a single colony. Colonies for DNA isolation and sequencing were grown on a solid medium at 8°C and pH 7.5 for 4 weeks. The Hungate tube with colonies was transferred to the BioSpark Company (Troitsk, Russia) for genomic DNA preparation and sequencing. Genomic DNA was isolated with the FastDNA spin kit (MP Biomedicals, USA) by the column method with deposition on silica gel. The libraries were synthesized using KAPA HyperPlus kits (Kapa Biosystems, USA) in accordance with the manufacturer’s recommendations. Sequencing was performed on the Illumina NovaSeq 6000 platform, and a paired-end library with a total of 2,786,968 read pairs and a read length of 2 × 150 bp was obtained. The Kbase (3) online platform with associated tools was used for genome assembly and analysis. The quality of the reads was assessed using the FastQC v.0.12.1 program (4). Reads were processed in the Trimmomatic (5) program v. 0.36 with the parameters “HEADCROP = 15, MINLEN = 50” and removing adapters from the TruSeq3-PE-2 set. The assembly with the best parameters was obtained using the Unicycler v.0.4.8 assembler (6). Genome completeness and contamination were evaluated using CheckMv.1.0.18 (7). Taxonomy was assigned using GTDBtk v.1.7.0 (8) and TYGS server (9). The orthoANI calculation was made using the online service https://www.ezbiocloud.net/tools/ani. Alignment of reads to genomes, determination of coverage, and the number of aligned reads were carried out using the bowtie2 v. 2.3.2 program (10).

The genome was annotated using the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) (11). The genome contained 108 contigs with a maximum length of 420,954 bp. The assembled draft genome sequence length was 5,292,910 bp. The G + C content was 51.1%, the scaffold N_50_ value was 150.9 kb, and the L_50_ value was 12. Reads alignment demonstrated that the genome accounted for 68.33% of the reads, with coverage of 106.34 ± 40.32. Genome completeness was 100%, contamination was 0.96%.

The total number of genes was 4,950, including 4,734 protein-coding sequences, 61 tRNA, and 2 rRNA genes (one 16S and one 23S). The closest related type strain, according to TYGS, was Acerihabitans arboris SAP-6^T^ (assembly GCA_010131535.1, WGS WUBS01), isolated from tree sap (Jeju, South Korea) (12) with DNA-DNA hybridization (DDH) 22.6% and orthoANI 78.2%. 16S rRNA gene was extracted from the genome. The similarity between nucleotide sequences of 16S rRNA genes of strains TG2^T^ (1551 bp length) and SAP-6^T^ (MN737198.1) was 98.2%. Taken together with previous data, this may indicate that the studied strain is a representative of a new species of the genus Acerihabitans.

Bibliography12

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Hungate RE. 1969. A roll tube method for cultivation of strict Anaerobes, p 117–132. In Norris R, Ribbons RW (ed), In methods in microbiolology. Vol. 13. Academic Press, New York.
2Zakharyuk AG, Kopitsyn DS, Suzina NE, Shcherbakova VA. 2023. Pelosinus baikalensis sp. nov., an iron-reducing bacterium isolated from a cold freshwater Lake. Microbiology 92:137–145. doi:10.1134/S 0026261722602913 · doi ↗
3Arkin AP, Cottingham RW, Henry CS, Harris NL, Stevens RL, Maslov S, Dehal P, Ware D, Perez F, Canon S, et al.. 2018. K Base: the United States department of energysystems biology knowledgebase. Nat Biotechnol 36:566–569. doi:10.1038/nbt.416329979655 PMC 6870991 · doi ↗ · pubmed ↗
4Andrews S. 2010. Fast QC: a quality control tool for high throughputsequence data. Available from: https://github.com/s-andrews/Fast QC
5Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for illumina sequence data. Bioinformatics 30:2114–2120. doi:10.1093/bioinformatics/btu 17024695404 PMC 4103590 · doi ↗ · pubmed ↗
6Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. P Lo S Comput Biol 13:e 1005595. doi:10.1371/journal.pcbi.100559528594827 PMC 5481147 · doi ↗ · pubmed ↗
7Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. Check M: assessing the quality of microbial genomes recovered from isolates, singlecells, and metagenomes. Genome Res 25:1043–1055. doi:10.1101/gr.186072.11425977477 PMC 4484387 · doi ↗ · pubmed ↗
8Chaumeil P-A, Mussig AJ, Hugenholtz P, Parks DH. 2019. GTDB-Tk: a tool-kit to classify genomes with the genome taxonomy database. Bioinformatics 36:1925–1927. doi:10.1093/bioinformatics/btz 84831730192 PMC 7703759 · doi ↗ · pubmed ↗