Draft genome sequence of rhamnolipid-producing bacteria Thermoanaerobacter thermocopriae strain CM-CNRG TB177 isolated from an oil reservoir in Mexico
Veronica Segovia, Pedro Martínez, Regina Hernández-Gama

TL;DR
This paper presents the draft genome sequence of a heat-loving bacteria that produces rhamnolipids from glucose, isolated from an oil reservoir in Mexico.
Contribution
The novel contribution is the first draft genome sequence of Thermoanaerobacter thermocopriae strain CM-CNRG TB177, highlighting its rhamnolipid production capability.
Findings
The genome size is approximately 2.5 million base pairs.
The genome encodes 2,550 predicted genes.
The bacterium is thermophilic, anaerobic, and produces rhamnolipids using glucose.
Abstract
The draft genome of Thermoanaerobacter thermocopriae CM-CNRG TB177 isolated from an oil reservoir in Mexico was determined and annotated. The organism is a thermophilic and strict anaerobe bacterium that produces rhamnolipids, using glucose as a carbon source. The predicted genome size is 2,496,169 bp and 2,550 genes.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
- —CONAHCYT
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Enhanced Oil Recovery Techniques · Microbial bioremediation and biosurfactants
ANNOUNCEMENT
The genus Thermoanaerobacter includes Gram-positive, thermophilic bacteria (1). Some species belonging to this genus have been studied for the production of biotechnologically important metabolites such as ethanol and enzymes, among others (2–4). The draft genome sequence presented here was particularly studied for rhamnolipid production (5). In addition to its biosurfactant production, the Thermoanaerobacter thermocopriae CM-CNRG TB177 strain is an extremophile bacterium, capable of thriving in temperatures ranging from 50°C to 80°C and in salt concentrations of up to 30 g/L of NaCl. These characteristics make this microorganism particularly valuable for industrial applications such as microbial enhanced oil recovery and mining.
The T. thermocopriae isolate was previously obtained from formation water at the Chicontepec oil field in Mexico (6). It was subsequently cultured in basal medium with glucose as the carbon source, as previously reported (7). Genomic DNA was extracted using the commercial ZymoBIOMICS DNA Miniprep Kit (Zymo Research, USA), and the DNA quality was verified using a NanoDrop One (Thermo Scientific, USA).
The DNA was sequenced twice. First, sequencing is performed using Illumina NextSeq 500 with the TruSeq DNA PCR-Free Sample Prep Kit in a 2 × 75 cycle configuration (Illumina, USA) by the Massive Sequencing Unit of the UNAM Institute of Biotechnology, constructing genomic DNA libraries with 350-bp fragments. The second sequencing was performed using Illumina MiSeq device in a 2 × 300 run format by the Genomic Services Laboratory of the National Laboratory of Genomics for Biodiversity (LANGEBIO), using the Illumina TruSeq DNA Nano Kit to construct the genomic DNA libraries using 550-bp fragments.
From the first sequencing, 28,471,116 paired raw sequences of 75 bp were obtained, and from the second sequencing, 1,341,482 paired raw sequences of 300 bp were obtained. The initial quality of the sequences was analyzed using the FastQC v0.11.9 program (8). Subsequently, the sequences were subjected to a cleaning process with the TrimmomaticPE v0.39 program (9). For the first sequencing data, the parameters used were a sliding window of 4:20, a head crop of one, trailing of two, and minimum length of 60. For the second sequencing data, the parameters used were a sliding window of 5:20 and the Ilumina clip to eliminate adaptors.
After cleaning 12,737,254 paired sequences of approximately 75 bp and 11,255,044 paired sequences of length 200–299 bp were obtained, with these reads, a coverage of 1,000 was estimated. From these sequences, genome hybrid assembly was performed using SPAdes version 3.15.3 (10), using only the assembly without sequence editing. The assembly generated 294 contigs, with a sequence length of 2,496,169, an N50 value of 94,187 bp; the longest contig was 252,232 bp, with a guanine and cytosine content of 34.68%. All tools were run with default parameters unless otherwise specified.
Genome annotation was performed using the Prokaryotic Genome Annotation Pipeline from the National Center for Biotechnology Information (11) version 6.7; as a result, 2,550 genes were obtained in the genome, including four complete rRNAs, 57 tRNAs, and four clustered regularly interspaced short palindromic repeats arrays. A genomic analysis was conducted using the GTDB-Tk toolkit (12), resulting in the assignment of T. thermocopriae.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Schleifer KH. 2009. Phylum XIII. Firmicutes Gibbons and Murray 1978, p 387–68489. In De Vos P (ed), Bergey’s manual of systematic bacteriology. Springer, New York.
- 2Musa MM, Vieille C, Phillips RS. 2021. Secondary alcohol dehydrogenases from Thermoanaerobacter pseudoethanolicus and Thermoanaerobacter brockii as robust catalysts. Chembiochem 22:1884–1893. doi:10.1002/cbic.20210004333594812 · doi ↗ · pubmed ↗
- 3Tian LF, Gao H, Yang S, Liu YP, Li M, Xu W, Yan XX. 2023. Structure and function of extreme TLS DNA polymerase TTE Dbh from Thermoanaerobacter tengcongensis. Int J Biol Macromol 253:126770. doi:10.1016/j.ijbiomac.2023.12677037683741 · doi ↗ · pubmed ↗
- 4Burger Y, Schwarz FM, Müller V. 2022. Formate-driven H 2 production by whole cells of Thermoanaerobacter kivui. Biotechnol Biofuels Bioprod 15:48. doi:10.1186/s 13068-022-02147-535545791 PMC 9097184 · doi ↗ · pubmed ↗
- 5Segovia V, Reyes A, Rivera G, Vázquez P, Velazquez G, Paz-González A, Hernández-Gama R. 2021. Production of rhamnolipids by the Thermoanaerobacter sp. CM-CNRG TB 177 strain isolated from an oil well in Mexico. Appl Microbiol Biotechnol 105:5833–5844. doi:10.1007/s 00253-021-11468-834396489 · doi ↗ · pubmed ↗
- 6Hernández R. 2014. Analysis of the microbial diversity of a Chicontepec well and the conditions in which it favors the emulsification of crude oil, Ph D thesis. Instituto Politecnico Nacional. https://tesis.ipn.mx/bitstream/handle/123456789/19572/REGINA.pdf?sequence=1&is Allowed=y.
- 7Feng X, Mouttaki H, Lin L, Huang R, Wu B, Hemme CL, He Z, Zhang B, Hicks LM, Xu J, Zhou J, Tang YJ. 2009. Characterization of the central metabolic pathways in Thermoanaerobacter sp. strain X 514 via isotopomer-assisted metabolite analysis. Appl Environ Microbiol 75:5001–5008. doi:10.1128/AEM.00715-0919525270 PMC 2725511 · doi ↗ · pubmed ↗
- 8Andrews S. 2010. Fast QC: a quality control tool for high throughput sequence data. Available from: http://www.bioinformatics.babraham.ac.uk/projects/fastqc
