Draft genome sequence of Dietzia sp. strain CH92 isolated from oil reservoir
Wei Xiang, Quan Zhang, Jianxin Wang, Yanfen Xue, Bo Yu

TL;DR
This paper presents the draft genome of a new Dietzia bacteria strain from an oil well in China.
Contribution
The study provides a new bacterial genome sequence from a high-temperature oil reservoir.
Findings
The draft genome of Dietzia sp. strain CH92 is 3.73 Mb in size.
The genome contains 3,479 predicted protein-coding sequences.
Abstract
We report the draft genome sequence of Dietzia sp. strain CH92, isolated from a high temperature oil well in Baolige oilfield, China. The estimated genome is 3.73 Mb, with 3,479 protein-coding sequences.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
- —MOST | National Key Research and Development Program of China (NKPs)
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Microbial Community Ecology and Physiology · Protist diversity and phylogeny
ANNOUNCEMENT
Dietzia species are aerobic, Gram-positive and distributed in various environments including crude oil reservoirs (1, 2). The organisms can use carbohydrates, organic acids, and hydrocarbons as carbon and energy sources (2). Dietzia sp. strain CH92 was isolated from oil-well production liquid of Baolige oilfield (44°86′N, 115°81′E) in China. The whole-genome sequence will help us understand its function and application potential.
Strain CH92 was isolated by dilution plating method. The oil-well production liquid sample collected from a sampling valve at the pipeline of the well head by fully filling 25 L plastic sampling bottles. The samples were transported to the laboratory and stored overnight at ambient temperature (about 15–20°C). Since the cell biomass is rather low in this water sample, we first concentrated it by using 0.22 µm hollow-fiber filter (MOF-1d, purchased from Motech Co. Ltd., Tianjin, China), and then, the filtrate was used for dilution and plating on the agar medium. The concentrated liquid was diluted with sterilized enrichment medium (ENM), plated on ENM agar, and incubated at 45°C (3). The pure strain was obtained by repeated streaking on the same medium agar plates. Strain CH92 was routinely cultured at 37°C in modified DSMZ 878 medium (4) supplemented with 10 g L^−1^ succinate or 1% (wt/vol) n-hexadecane as carbon sources. Genomic DNA was prepared from an overnight culture in the modified DSMZ 878 medium with the addition of g L^−1^ succinate by using the TIANamp Bacteria DNA kit (TIANGEN, China). The quality and concentration of DNA were determined using a Quantus Fluorometer with the Quant-iT PicoGreen dsDNA Assay Kit (Thermo Fisher Scientific, USA).
DNA samples were sheared into 400–500 bp fragments using a Covaris M220 Focused Acoustic Shearer following manufacture’s protocol. The Illumina PE libraries were prepared from the sheared fragments using the NEXTflex Rapid DNA-Seq Kit (Bioo Scientific, USA) and sequenced in the 150 bp pair-end mode using the Illumina Hiseq × 10 platform at Majorbio Bio-Pharm Technology Inc. (Shanghai, China). The sequencing generated 5,876,037 pairs of raw reads totaling 1,774,563,174 bp, giving approximately 473 × coverage. The reads were quality-trimmed with Trimmomatic v.0.36 (5) and assembled using SOAPdenovo v2 (6). The resultant assembly totaled 3,729,715 bp with 32 contigs, an N50 value of 636,736 bp, and a GC content of 71.40%.
The genomic contigs were analyzed using I-Sanger Cloud Platform from Shanghai Majorbio in March, 2021. Glimmer v3.02 (http://ccb.jhu.edu/software/glimmer/index.shtml) (7) was used for coding DNA sequence (CDS) prediction, tRNA-scan-SE v2.0 (http://trna.ucsc.edu/software/) (8) was used for tRNA prediction, and Barrnap v0.8 (https://github.com/tseemann/barrnap) was used for rRNA prediction. The predicted CDSs were annotated from NR, Swiss-Prot, Pfam, GO, COG, and KEGG database using sequence alignment tools: BLAST + v2.3.0 (9), Diamond v0.8.35 (10), and HMMER v3.1b2 (11). A total of 3,479 CDS genes in addition to 50 tRNAs and 5 rRNAs were annotated for the draft genome sequence. Default parameters were used for all software. The annotation was also uploaded in Fig Share with the link of https://figshare.com/articles/online_resource/The_genome_annotation_with_predicted_functions_for_each_and_every_gene_of_i_Dietzia_i_sp_strain_CH92/24967986.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Olowo-Okere A, Ibrahim YKE, Lo CI, Olayinka BO, Yimagou EK, Yacouba A, Mohammed Y, Nabti LZ, Ragueh AA, Lupande D, Raoult D, Rolain J-M, Diene SM. 2022. Bhargavaea massiliensis sp. nov. and Dietzia massiliensis sp. nov., novel bacteria species isolated from human urine samples in Nigeria. Curr Microbiol 79:18. doi:10.1007/s 00284-022-02838-034905116 · doi ↗ · pubmed ↗
- 2Pukall R. 2014. The family Dietziaceae, p 327–338. In Rosenberg E, De Long EF, Lory S, Stackebrandt E, Thompson F (ed), The Prokaryotes: Actinobacteria, 4th edn. Springer, Berlin.
- 3Xiang W, Liang Y, Hong S, Wang G, You J, Xue YF, Ma YH. 2022. Degradation of long-chain n-alkanes by a novel thermal-tolerant Rhodococcus strain. Arch Microbiol 204:259. doi:10.1007/s 00203-022-02872-335419660 · doi ↗ · pubmed ↗
- 4Degryse E, Glansdorff N, Piérard A. 1978. A comparative analysis of extreme thermophilic bacteria belonging to the genus Thermus. Arch Microbiol 117:189–196. doi:10.1007/BF 00402307678024 · doi ↗ · pubmed ↗
- 5Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi:10.1093/bioinformatics/btu 17024695404 PMC 4103590 · doi ↗ · pubmed ↗
- 6Luo R, Liu B, Xie Y, Li Z, Huang W, Yuan J, He G, Chen Y, Pan Q, Liu Y, et al.. 2012. SOA Pdenovo 2: an empirically improved memory-efficient short-read de novo assembler. Gigascience 1:18. doi:10.1186/2047-217X-1-1823587118 PMC 3626529 · doi ↗ · pubmed ↗
- 7Delcher AL, Bratke KA, Powers EC, Salzberg SL. 2007. Identifying bacterial genes and endosymbiont DNA with glimmer. Bioinformatics 23:673–679. doi:10.1093/bioinformatics/btm 00917237039 PMC 2387122 · doi ↗ · pubmed ↗
- 8Lowe TM, Eddy SR. 1997. t RN Ascan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964. doi:10.1093/nar/25.5.9559023104 PMC 146525 · doi ↗ · pubmed ↗
