Genome assembly of antimicrobial-resistant Escherichia coli HMVC1 isolated from healthy Mogosane village cattle, South Africa
Lerato Lisbeth Njaki Makhetha, Baitsholetsi Gloria Mokolopi, James Wabwire Oguttu, Christian Anayochukwu Mbajiorgu, Goitsemang Makete, Tshifhiwa Paris Mamphogoro

TL;DR
This paper reports the genome assembly of an antimicrobial-resistant Escherichia coli strain from healthy cattle in South Africa.
Contribution
The novel contribution is the genome assembly of HMVC1, an antimicrobial-resistant E. coli from healthy cattle in South Africa.
Findings
The genome of HMVC1 is 5,043,843 bp with a G + C content of 50.5%.
The strain contains several antimicrobial resistance genes including marA, mdtM, acrF, and acrD.
Abstract
Here, we present the genome assembly of E. coli strain HMVC1 isolated from rectal fecal samples of healthy cattle in South Africa. The genome size of HMVC1 consisted of 5,043,843 bp, with G + C content of 50.5%. The strain harbors marA, mdtM, acrF, acrD, and other antimicrobial resistance genes.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
- —University of South Africa (Unisa)
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRNA and protein synthesis mechanisms · Escherichia coli research studies · Genomics and Phylogenetic Studies
ANNOUNCEMENT
Escherichia coli in farm animals has extensively been studied for decades in developed countries, and its data on whole genomic sequencing (WGS) are stored in the GenBank. Developing countries such as South Africa still have more work regarding depositing WGS of E. coli isolates from cattle in the GenBank for future studies (1). Besides the reputation of some strains of E. coli as animal pathogens, E. coli has been shown to account for the majority of resistance in Enterobacteriaceae; this bacterium has been postulated to serve as a reservoir of antimicrobial resistance genes within the digestive tract (2).
E. coli HMVC1 was isolated from rectal fecal samples of healthy cattle in Mogosane Village in the North West province, South Africa (25°45′30.6″S 25°33′43.9″E). A loopful of feces sample was inoculated into Buffered Peptone Water medium; the mixture was serially diluted up to 10^−2^ and inoculated onto McConkey agar for 24 h at 37°C. The pure culture was obtained by repeated streaking onto sterile Blood Agar (BA) (3). Genomic DNA was isolated from overnight culture on BA using HighPure PCR template preparation kit (Roche Diagnostics, Mannheim, Germany), in accordance with the manufacturer’s instructions. A NanoDrop (ThermoFisher Scientific, Carlsbad, CA, USA) was used to determine the concentration of extracted DNA, while the quality of the DNA was assessed using a 2% agarose gel. The DNA libraries were generated employing the Illumina TruSeq DNA Nano Preparation Kit (Illumina, San Diego, CA, USA); these libraries were sequenced by 150 bp paired-end sequencing on Illumina Hiseq X machine, producing a total of 4,845,267 paired-end 2 × 150 bp reads. The raw reads were trimmed using Trimmomatic v0.36 (4) and quality controlled using FastQC v0.11.5 (5). Trimmed reads were then de novo assembled using SPAdes v3.13.0 (6). Assembly quality was evaluated using QUAST v5.0.2 (7). While the genome completeness and contamination were assessed using CheckM v1.0.18 (8).
Identification of HMVC1 was conducted using Kaiju v1.7.3 (9), and the results were visualized using Krona v2.7.1 (10). Annotation was performed using the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) v6.3 (11). Default parameters were used for all software unless otherwise specified. The HMVC1 genome size consisted of 5,043,843 bp and is predicted to contain 4,960 total genes, of which 4,663 are predicted to encode proteins. Genome completeness was estimated at 100%, consisting of 110 contigs and the G + C content of 50.5%. The N50 value was 141,930 bp with a genome coverage of 354×. Antimicrobial-resistant genes were screened using resistant gene identifier (RG1) v5.1.1, which employs the comprehensive antibiotic resistance database v3.1.0 (12), allowing a broad homology-based search with defined criteria ranging from “Perfect” to “Strict” matches. A total of 38 strict hits along with 15 perfect were detected. E. coli HMVC1 harbor genes providing resistance to fluoroquinolones, cephamycins, tetracyclines, rifamycins, and penams along with that the resistance mechanisms mainly antibiotic efflux, reduced permeability, and antibiotic target alteration (Table 1) (13, 14). This study marks the ability of the isolated bacterial strains to be used in the production of novel antimicrobial compounds and in the surveillance of antimicrobial drug resistance.
TABLE 1: (A) Characteristics of the sequenced Escherichia coli HMVC1 and (B) antibiotic-resistant genes encoded by the strain
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Gambushe SM, Zishiri OT, El Zowalaty ME. 2022. Review of Escherichia coli O 157:H 7 prevalence, pathogenicity, heavy metal and antimicrobial resistance, African perspective. Infect Drug Resist 15:4645–4673. doi:10.2147/IDR.S 36526936039321 PMC 9420067 · doi ↗ · pubmed ↗
- 2Avalos M, Boetzer M, Pirovano W, Arenas NE, Douthwaite S, van Wezel GP. 2018. Complete genome sequence of Escherichia coli as 19, an antibiotic-sensitive variant of E. coli strain B REL 606. Genome Announc 6:e 00385-18. doi:10.1128/genome A.00385-1829724850 PMC 5940949 · doi ↗ · pubmed ↗
- 3Makhetha LLN, Ntushelo K, Mokolopi BG, Oguttu JW, Mbajiorgu CA, Makete G, Mamphogoro TP. 2023. Draft genome assemblies of 12 Enterobacter hormaechei strains isolated from the North West province of South Africa. Microbiol Resour Announc 12:1–5. doi:10.1128/MRA.00757-23PMC 1065293337906021 · doi ↗ · pubmed ↗
- 4Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for illumina sequence data. Bioinformatics 30:2114–2120. doi:10.1093/bioinformatics/btu 17024695404 PMC 4103590 · doi ↗ · pubmed ↗
- 5Andrews S. 2010. Fast QC: a quality control tool for high throughput sequence data. Babraham Institute, Cambridge, United Kingdom. http://www.bioinformatics.babraham.ac.uk/projects/fastqc.
- 6Nurk S, Bankevich A, Antipov D, Gurevich AA, Korobeynikov A, Lapidus A, Prjibelski AD, Pyshkin A, Sirotkin A, Sirotkin Y, Stepanauskas R, Clingenpeel SR, Woyke T, Mc Lean JS, Lasken R, Tesler G, Alekseyev MA, Pevzner PA. 2013. Assembling single-cell genomes and mini-metagenomes from chimeric MDA products. J Comput Biol 20:714–737. doi:10.1089/cmb.2013.008424093227 PMC 3791033 · doi ↗ · pubmed ↗
- 7Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi:10.1093/bioinformatics/btt 08623422339 PMC 3624806 · doi ↗ · pubmed ↗
- 8Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. Check M: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. doi:10.1101/gr.186072.11425977477 PMC 4484387 · doi ↗ · pubmed ↗
