Genome sequence of Providencia rettgeri ZAPR22R, isolated from calla lily in China
Yin Jiang, Zhen Zeng, Zengxian Wang, Yi Wang, Rongxin Gou, Xiaorong Huang, Yushan Lu, Guojun Zhang, Zunzheng Wei

TL;DR
This paper reports the full genome sequence of a Providencia rettgeri strain found in infected calla lilies in China.
Contribution
The study provides the complete genome sequence of a new Providencia rettgeri strain isolated from a plant.
Findings
The genome is a single chromosome of 4,528,722 base pairs.
The G+C content of the genome is 41.1%.
Abstract
Here, we present the complete genome sequence of Providencia rettgeri strain ZAPR22R, isolated from the petiole and tuber of calla lily (Zantedeschia hybrida), infected with soft rot. The genome consists of a single chromosome (4,528,722 bp) with a G+C content of 41.1%.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
| Characteristic | Result |
|---|---|
| Sequencing | |
| Raw reads | |
| No. of reads | 49,033 |
| Size (bp) | 418,203,977 |
| Clean reads | |
| No. of reads | 47,446 |
| Size (bp) | 409,703,740 |
| N50 (bp) | 9,216 |
| Average read length (bp) | 8,635 |
| Genome | |
| Assembly size (bp) | 4,528,722 |
| Contig | 1 |
| G + C content (%) | 41.1 |
| No. of coding genes | 4,084 |
| Total gene size (bp) | 3,865,407 |
| Average gene length (bp) | 946 |
| Total repetitive sequence length (bp) | 13,203 |
| Repetitive sequence content (%) | 0.29 |
| rRNA | 8,7,7 (5S, 16S, 23S) |
| tRNA | 82 |
| CRISPR | 4 |
| Genomic island | 4 |
| Prophage | 4 |
| Gene cluster | 2 |
| Protein subcellular localization | |
| Signal peptide | 392 |
| Transmembrane protein | 1,023 |
| Secreted protein | 311 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Plant Pathogens and Fungal Diseases · Plant and Fungal Interactions Research
ANNOUNCEMENT
Providencia rettgeri, an opportunistic pathogen, has been described to be implicated in various diseases such as urinary tract infections, diarrhea, and meningitis (1). It can be isolated from diseased tissues, and genomic analysis indicates that P. rettgeri is resistant to several antibiotics (2). Additionally, P. rettgeri has been proven to be one of the pathogens causing soft rot (3), which causes the decay of plant organs or tissues, and is fatal and difficult to control, resulting in substantial economic losses. The bacterial genome was obtained on the basis of our previous research.
P. rettgeri ZAPR22R was primordially isolated from the petiole and tuber of soft rot calla lily in Yanqing District, Beijing, China, in August 2022, in our previous work (3). Petiole and tuber samples were collected, surface-sterilized with 75% alcohol, and then rinsed with sterile distilled water (SDW) multiple times. The clean samples were then ground into a homogenate, plated onto Luria-Bertani (LB) agar, and incubated at 28°C for 12 h. Single colonies were selected for purification, and the purified single colony was propagated for DNA (FastPure Bacteria DNA Isolation Mini Kit, Nanjing Vazyme Biotech Co., Ltd.) extraction and genomic sequencing.
Library preparation and genome sequencing were performed at Biomarker Technologies, according to the protocol provided by PacBio. PacBio sequencing technology uses SMRT chip as the sequencing carrier. In the nanopore inside the SMRT chip, DNA polymerase and template are combined and four colors are fluorescently labeled with four bases (deoxyribonucleoside triphosphates [dNTPs]). In the base pairing stage, different bases are added, different light will be emitted, and the type of base can be judged according to the wavelength and peak of light. The DNA sample was sheared using g-TUBE, and then damage repair and terminal repair were performed. The dumbbell joints were connected, and the exonuclease digestion was carried out.The target fragments were screened using BluePippin, and then the sequencing libraries were obtained. Shorter reads (<2 kb) were filtered after sequencing using PacBio HIFI mode. Fastuniq v.1.1 (-t q), trimmomatic v.0.33 (LEADING:3 TRAILING:3 SLIDINGWINDOW:50:20 MINLEN:100), and filtlong v.0.2.0 were used for quality control. A total of 47,446 reads, with an average length of 8,635 bp, were obtained. The filtered subreads were assembled, followed by circularization (4). Spades v.3.6.2 (-m 300 -k 21,33,55,77,99,127 --cov-cutoff auto), canu v.1.5 (useGrid=false overlapper=mhap utgRe Align=true), wtdbg v.2.5, flye v.2.8.2, hifiasm v.0.12, racon v.1.4.0 (-u -t 4), and pilon v.1.22 (--mindepth 1 --changes --fix all) were used for assembly. Circlator v.1.5.5 (minimus2 --no_pre_merge) was used for ring judgment. Default parameters were used for all software unless otherwise specified. The genome of P. rettgeri ZAPR22R presumably consists of a single chromosome of 4,528,722 bp and exhibits a G+C content of 41.1% (Table 1). The topology of the contig was predicted to be circular.
The NCBI Prokaryotic Genome Annotation Pipeline v.5.3 was used for annotation of public assembly (5), while coding genes were predicted using Prodigal v.2.6.3 for analyses reported here (6). Repeat sequences were searched by RepeatMasker v.4.0.5 (7). Transfer RNA (tRNA) genes were predicted with tRNAscan-SE v.2.0. Ribosome RNA genes were predicted with Infernal v.1.1.3 (8–10). CRT v.1.2, CRISPR, genomic island, prophage, and gene cluster were separately predicted by IslandPath-DIMOB v.0.2, PhiSpy v.2.3, and antiSMASH v.5.0.0 (11–14). The transmembrane proteins were filtered by TMHMM. The secretory proteins were detected by SignalP; after transmembrane proteins were filtered by TMHMM, the candidate secretory proteins could be obtained (15, 16). Analysis results are listed in Table 1.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Marquez-Ortiz RA, Haggerty L, Sim EM, Duarte C, Castro-Cardozo BE, Beltran M, Saavedra S, Vanegas N, Escobar-Perez J, Petty NK. 2017. First complete Providencia rettgeri genome sequence, the NDM-1-producing clinical strain RB 151. Genome Announc 5:e 01472-16. doi:10.1128/genome A.01472-1628104655 PMC 5255920 · doi ↗ · pubmed ↗
- 2Udaondo Z, Abram KZ, Gulley T, Garner K, Shray J, Whisnant M, Harris-Spotts A, Crawford M, Kothari A, Wongsurawat T, Moon SH, Huang E, Jun SR. 2022. Complete genome sequence of a non-carbapenemase-producing carbapenem-resistant Providencia rettgeri strain isolated from a clinical urine sample in Arkansas. Microbiol Resour Announc 11:e 0047422. doi:10.1128/mra.00474-2235894621 PMC 9387286 · doi ↗ · pubmed ↗
- 3Wang X, Wang D, Pun MK, Gou R, Liang H, Wei Z. 2023. First report of Providencia rettgeri causing soft rot on calla lily in China . Plant Dis 107:2837. doi:10.1094/PDIS-02-23-0374-PDN · doi ↗
- 4Cheng H, Concepcion GT, Feng X, Zhang H, Li H. 2021. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods 18:170–175. doi:10.1038/s 41592-020-01056-533526886 PMC 7961889 · doi ↗ · pubmed ↗
- 5Tatusova T, Di Cuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res 44:6614–6624. doi:10.1093/nar/gkw 56927342282 PMC 5001611 · doi ↗ · pubmed ↗
- 6Hyatt D, Chen GL, Locascio PF, Land ML, Larimer FW, Hauser LJ. 2010. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11:119. doi:10.1186/1471-2105-11-11920211023 PMC 2848648 · doi ↗ · pubmed ↗
- 7Tarailo-Graovac M, Chen N. 2009. Using Repeat Masker to identify repetitive elements in genomic sequences. Curr Protoc Bioinformatics Chapter 4:4. doi:10.1002/0471250953.bi 0410 s 2519274634 · doi ↗ · pubmed ↗
- 8Kalvari I, Argasinska J, Quinones-Olvera N, Nawrocki EP, Rivas E, Eddy SR, Bateman A, Finn RD, Petrov AI. 2018. Rfam 13.0: shifting to a genome-centric resource for non-coding RNA families. Nucleic Acids Res 46:D 335–D 342. doi:10.1093/nar/gkx 103829112718 PMC 5753348 · doi ↗ · pubmed ↗
