Genome Sequence of Microbacterium foliorum Phage KingKamren
Alyssa Gleichsner, Kamren Harden, Kathryn Salphine, Jill Sandel, Mikaela Bova, Bella Denapole, Anastasia Godlewski, Samara Acevedo, Alexander Galarneau, Mazon Zales, Gustavia Twumasi, Sophia Voss, Faith Haynes, Rebekah Abdul-Wahhab, Banfy Bu, Abigail Favro, Amen Zergaw

TL;DR
Scientists discovered and sequenced a new bacteriophage called KingKamren, which infects a soil bacterium called Microbacterium foliorum.
Contribution
The paper presents the genome sequence of a newly discovered cluster EK bacteriophage, KingKamren, isolated from a soil sample.
Findings
KingKamren's genome is 54,721 base pairs long and contains 51 putative genes.
Seventeen of the 51 genes have predicted functions identified in the study.
Abstract
We report the discovery and genome sequence of a cluster EK bacteriophage, KingKamren, isolated from a soil sample collected in Plattsburgh, New York using the bacteria Microbacterium foliorum , B-24224. Its 54,721 bp genome contains 51 putative genes, 17 of which have predicted functions.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFractal and DNA sequence analysis · Machine Learning in Bioinformatics
Description
Bacteriophages are some of the most genetically diverse entities known (Pope et al., 2015). Understanding this diversity has implications for both ecological and health applications (Milhaven et al., 2023; Strathdee et al., 2023). Here we present the genome of a new bacteriophage, KingKamren, which was collected in 2023 from Plattsburgh NY (44.69179 N, 73.46351 W) using the bacterial host Microbacterium foliorum , B-24224.
Following standard procedures (Zorawik et al., 2024), approximately 15 cm ^3^ of soil was suspended in 35 mL PYCa medium and shaken at 37°C at 250 rpm for 2 hours followed by centrifugation at 2,000 g and vacuum filtration (0.22 μm filter) of the supernatant. This filtrate was inoculated with M. foliorum and incubated at 30°C for 5 days at 250 rpm. An aliquot was spun at 14,000 g, filtered, and plated in PYCa top agar containing M. foliorum . After 48 hours, KingKamren formed small, turbid plaques with an average size of 1.17 mm (∓ 0.10 SE) in diameter, as determined through measurements using ImageJ (Schnieder et al., 2012) ( Fig. 1A ) and was purified through two rounds of plating.
DNA was isolated from a lysate (Wizard DNA Clean-up Kit, Promega), prepped for sequencing (NEBNext Ultrall FS Kit), sequenced (Illumina sequencing, v3 reagents) and assembled as described by Russell (Russell, 2018). Sequencing resulted in 2,929,352 single-end 150-bp reads with 4,991-fold coverage. The genome was assembled using Newbler v2.9 (Margulies et al,. 2005) and checked for completeness and genome termini using Consed v29.0 (Gordon et al., 1998). Default settings were used unless otherwise noted. This resulted in a genome 54,721 bp in length with 203-bp direct terminal repeat ends and a GC content of 57.5%.
Using standard procedures (Pope et al., 2017), the software DNA Master v5.23.6 (http://cobamide2.bio.pitt.edu), PECAAN (https://discover.kbrinsgd.org), Genemark v2.5p (Lukashin and Borodovsky, 1998), and Glimmer v3.02 (Delcher et al., 1999) were used to predict 53 protein-encoding genes. Start sites were determined using Starterator v485 (https://seaphages.org/software/#Starterator) and Blastp v2.13.0 (Altschul et al., 1990) alignments against the Actinobacteriophage protein (Russell and Hatful, 2017) and NCBI non-redundant protein sequences databases (https://blast.ncbi.nlm.nih.gov). No strong evidence for tRNAs was found using Aragorn v1.2.41 (Laslett and Canback, 2004) and tRNAscan-SE v2.0 (Lowe and Eddy, 1997). A total of 14 genes were assigned putative functions using BLASTp v2.13.0 (Altschul et al., 1990), Phamerator (Cresawn et al., 2011), and HHpred (searching against PDB_mmCIF70, SCOPe70, Pfam-A, and NCBI_Conserved_Domains databases) (Söding et al., 2005). deepTMHMM v1.0.24 (Krogh et al., 2001) and SOSUI (Hirokawa et al. 1998) detected an additional 3 genes as membrane proteins. All software used default settings. The annotation is presented in Fig. 1B
KingKamren was assigned to the EK cluster using the GCS tool (Hirokawa et al., 1998), based on having a gene content similarity (GCS) of at least 35% to other EK bacteriophages in the Actinobacteriophage database. The EK cluster currently contains 56 members. The majority of EK phages are placed into one of two subclusters, EK1 or EK2, but KingKamren is one of three phages (to date) that are not sub-classified, as its GCS is similar to both EK1 and EK2 phages (Figure 1b). This small subset of EK phages share 4 genes (KingKamren’s genes 12, 26, 35, and 47 – Fig. 1C ) of unknown function that are unique to this group and do not share significant sequence similarity to any other actinobacteriophage in the database (phagesdb.org). KingKamren also has a 13,452 bp gene (gene 28, Fig. 1B ) which constitutes 24.6% of its entire genome and encodes a 4,483 amino acid protein of unknown function. This feature is found across all members in the EK phage cluster and represents one of the largest genes in actinobacteriophages (Jacobs-Sera et al., 2020).
Nucleotide sequence accession numbers
KingKamren is available at GenBank with Accession No. XPP978791 and Sequence Read Archive (SRA) No. SRX25029057.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Altschul Stephen F. Gish Warren Miller Webb Myers Eugene W. Lipman David J. 1990101 Basic local alignment search tool Journal of Molecular Biology 21530022-283640341010.1016/s 0022-2836(05)80360-22231712 · doi ↗ · pubmed ↗
- 2Cresawn SG, Bogel M, Day N, Jacobs-Sera D, Hendrix RW, Hatfull GF. 2011. Phamerator: a bioinformatic tool for comparative bacteriophage genomics. BMC Bioinformatics 12:395–395. https://doi.org/10.1186/1471-2105-12-39510.1186/1471-2105-12-395PMC 323361221991981 · doi ↗ · pubmed ↗
- 3Delcher A. 1999121 Improved microbial gene identification with GLIMMER Nucleic Acids Research 27231362-49624636464110.1093/nar/27.23.463610556321 PMC 148753 · doi ↗ · pubmed ↗
- 4Hirokawa T, Boon-Chieng S, Mitaku S. 1998. SOSUI: classification and secondary structure prediction system for membrane proteins. Bioinformatics 14:378–379. https://doi.org/10.1093/bioinformatics/14.4.37810.1093/bioinformatics/14.4.3789632836 · doi ↗ · pubmed ↗
- 5Gordon David Abajian Chris Green Phil 199831 Consed: A Graphical Tool for Sequence Finishing Genome Research 831088-905119520210.1101/gr.8.3.1959521923 · doi ↗ · pubmed ↗
- 6Jacobs-Sera Deborah Abad Lawrence A. Alvey Richard M. Anders Kirk R. Aull Haley G. Bhalla Suparna S. Blumer Lawrence S. Bollivar David W. Bonilla J. Alfred Butela Kristen A. Coomans Roy J. Cresawn Steven G. D'Elia Tom Diaz Arturo Divens Ashley M. Edgington Nicholas P. Frederick Gregory D. Gainey Maria D. Garlena Rebecca A. Grant Kenneth W. Gurney Susan M. R. Hendrickson Heather L. Hughes Lee E. Kenna Margaret A. Klyczek Karen K. Kotturi Hari Mavrich Travis N. Mc Kinney Angela L. Merkhofer Evan C. Moberg Parker Jordan Molloy Sa · doi ↗ · pubmed ↗
- 7Krogh A, Larsson B, von Heijne G, Sonnhammer EL. 2001. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol 305:567–580. https://doi.org/10.1006/jmbi.2000.431510.1006/jmbi.2000.431511152613 · doi ↗ · pubmed ↗
- 8Laslett D, Canback B. 2004. ARAGORN, a program to detect t RNA genes and tm RNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. https://doi.org/10.1093/nar/gkh 15210.1093/nar/gkh 152PMC 37326514704338 · doi ↗ · pubmed ↗
