Complete genome sequence of Idiomarina sp. PL1-037 isolated from the pink hypersaline Pearse Lakes, Rottnest Island, Western Australia

Crystal E. Young; Hussain Alattas; Colin Scott; Daniel V. Murphy; Ravi Tiwari; Wayne G. Reeve

PMC · DOI:10.1128/mra.00157-24·July 5, 2024

Complete genome sequence of Idiomarina sp. PL1-037 isolated from the pink hypersaline Pearse Lakes, Rottnest Island, Western Australia

Crystal E. Young, Hussain Alattas, Colin Scott, Daniel V. Murphy, Ravi Tiwari, Wayne G. Reeve

PDF

Open Access

TL;DR

This paper presents the complete genome sequence of a new Idiomarina species isolated from a hypersaline lake in Western Australia.

Contribution

The study provides the first complete genome sequence of Idiomarina sp. PL1-037, an extremophile from the Pearse Lakes.

Findings

01

The genome consists of a single chromosome with 2,804,934 base pairs.

02

The GC content of the genome is 47.1%.

03

The isolate offers insights into culturable extremophiles from the Pearse Lakes microbiome.

Abstract

Idiomarina sp. PL1-037 was isolated from Pearse Lakes, Rottnest Island, Western Australia. The sequenced completed genome for PL1-037 is composed of a single chromosome (2,804,934 bp) with a GC content of 47.1%. Isolation of Idiomarina sp. PL1-037 provides insights about culturable extremophiles from the Pearse lakes microbiome.

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species2

Idiomarina sp. PL1-037 Idiomarina sp.(species)

Figures1

Click any figure to enlarge with its caption.

Fig 1 — Dendro-unweighted pair group method with arithmetic mean (UPGMA) tree displaying the relatedness of Idiomarina sp. PL1-037 to related species based on ANIb values. The ANIb values were generated using JSpeciesWS (10) and imported into DendroUPGMA (13), and the tree was constructed using a similarity matrix (Pearson’s correlation coefficient) within the algorithm (14). The tree produced was exported in Newick format and imported into TvBOT ( https://www.chiplot.online/tvbot.html) (15). The superscript T indicates type strains, and Escherichia coli str. K-12 substr. MG1655 K-12 was used as an outgroup (1).

Tables1

TABLE 1. General feature of the genome of Idiomarina sp. PL1-037 (accession: CP139873) from Prokaryotic Genome Annotation Pipeline

	Data from:
Feature	GenBank annotation
Total no. of genes	2,634
No. of protein-coding sequences	2,551
No. of rRNA operons	4
5S	4
16S	4
23S	4
No. of tRNA genes	56
No. of other RNA genes	4
Locus tag prefix	U0358_

Keywords

halophilesextremophilesenvironmental microbiology

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenomics and Phylogenetic Studies · Microbial Community Ecology and Physiology · Oral microbiology and periodontitis research

Full text

ANNOUNCEMENT

Idiomarina are gram-negative, rod-shaped, euryhalophiles found in a range of hypersaline environments (1). Rottnest Island (~30.4 km offshore of Perth, Western Australia) hosts athalassic soda hypersaline lakes, including Pearse Lakes (2). Here, we report the complete genome sequence of a Idiomarina sp. PL1-037, isolated from Pearse Lakes in May 2022, when the lakes had a pH of 8.0 ± 0.01 and salinity of 20.1% ± 1.44%.

Water samples were collected (S 32° 0′ 22.281″E 115° 30′ 44.484″) and stored at 4°C before cultivation. Water samples (1,500 mL) were centrifuged at 4,500 g for 10 minutes and streaked from the cell pellet on modified lysogeny media containing (per liter): 10.0 g bacto-tryptone, 5.0 g bacto-yeast extract, 150.0 g NaCl, 15.0 g Agar, and 2.4 g HEPES (pH 7.8) (3). Single colonies were re-streaked until pure cultures were obtained and cryopreserved (15% glycerol, −80°C). Genomic DNA (gDNA) was extracted from stationary phase culture using CTAB (2%) (4) and sequenced using Oxford Nanopore Technology (ONT). ONT library was prepared according to the ONT ligation native barcoding gDNA library protocol (SQK-NBD114.24) (https://community.nanoporetech.com/docs) with a FLO-PRO114M flow cell (R10.4.1) on the PromethION 2 platform. Guppy [v6.5.7; (5)] was used for base calling-sequenced data with a read-pass-filter quality score cutoff value of 9 and minimum length of 1,500 bp. A total of 24,613 reads were generated (175,120,911 bp), providing an average coverage of 61× and an N 50 value of 16,747 bp determined with NanoStat [v1.6.0; (6)]. ONT long reads were assembled using the Flye [v2.9.2; (7)] using default parameters with nine iterations, resulting in a single circular chromosome (2,804,934 bp) with a GC content of 47.1%. A quantitative genome assembly assessment was performed using BUSCO [v5.4.6; (8)] and CheckM [v1.2.2; (9)], providing a completeness score of 96.8% and 97.93% (0.79% contamination), respectively. Average nucleotide identity analysis using BLAST (ANIb) showed below to cut-off value (>95%) with the closest relative (91.7%) Idiomarina rhizosphaerae M1R2S28^T^ (GCF_024159085.1) (10 – 12) (Fig. 1).

Dendro-unweighted pair group method with arithmetic mean (UPGMA) tree displaying the relatedness of Idiomarina sp. PL1-037 to related species based on ANIb values. The ANIb values were generated using JSpeciesWS (10) and imported into DendroUPGMA (13), and the tree was constructed using a similarity matrix (Pearson’s correlation coefficient) within the algorithm (14). The tree produced was exported in Newick format and imported into TvBOT ( https://www.chiplot.online/tvbot.html) (15). The superscript T indicates type strains, and Escherichia coli str. K-12 substr. MG1655 K-12 was used as an outgroup (1).

Gene calling and annotation of the generated sequence were performed using the NCBI Prokaryotic Genome Annotation Pipeline (Table 1) [v6.6; (16)]. Isolating Idiomarina sp. PL1-037 and determining its complete genome will help advance our understanding of microbial life in Pearse Lakes’ extreme environment (17).

Bibliography17

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Taborda M , Antunes A , Tiago I , Veríssimo A , Nobre MF , da Costa MS . 2009. Description of Idiomarina insulisalsae sp. nov., isolated from the soil of a sea salt evaporation pond, proposal to transfer the species of the genus Pseudidiomarina to the genus Idiomarina and emended description of the genus Idiomarina. Syst Appl Microbiol 32:371–378. doi:10.1016/j.syapm.2009.06.005 19625151 · doi ↗ · pubmed ↗
2Playford PE , Leech R , Kendrick GW . 1977. Geology and hydrology of Rottnest Island. Vol. 6. Geological survey of Western Australia.
3Lennox ES . 1955. Transduction of linked genetic characters of the host by bacteriophage P 1. Virology 1:190–206. doi:10.1016/0042-6822(55)90016-7 13267987 · doi ↗ · pubmed ↗
4William S , Feil H , Copeland A . 2012. Bacterial genomic DNA isolation using CTAB. Sigma 50:6876.
5Wick RR , Judd LM , Holt KE . 2019. Performance of neural network basecalling tools for Oxford nanopore sequencing. Genome Biol 20:129. doi:10.1186/s 13059-019-1727-y 31234903 PMC 6591954 · doi ↗ · pubmed ↗
6De Coster W , D’Hert S , Schultz DT , Cruts M , Van Broeckhoven C . 2018. Nano Pack: visualizing and processing long-read sequencing data. Bioinformatics 34:2666–2669. doi:10.1093/bioinformatics/bty 149 29547981 PMC 6061794 · doi ↗ · pubmed ↗
7Kolmogorov M , Yuan J , Lin Y , Pevzner PA . 2019. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol 37:540–546. doi:10.1038/s 41587-019-0072-8 30936562 · doi ↗ · pubmed ↗
8Manni M , Berkeley MR , Seppey M , Simão FA , Zdobnov EM . 2021. BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol Biol Evol 38:4647–4654. doi:10.1093/molbev/msab 199 34320186 PMC 8476166 · doi ↗ · pubmed ↗