Draft genome assembly of Lacticaseibacillus paracasei MSR2 isolated from Bangladesh
Md. Shahedur Rahman, Ashikuzzaman Antor, Mahadi Hasan Sojol, Md. Monir Hossen, Mahadi Hasan, Rafi Hasnat Sarker, Mohammad Abu Hena Mostofa Jamal

TL;DR
This paper presents the draft genome of Lacticaseibacillus paracasei MSR2, a yogurt-isolated strain from Bangladesh.
Contribution
The novel contribution is the draft genome assembly of a newly isolated Lacticaseibacillus paracasei strain from Bangladesh.
Findings
The genome is 3,085,195 base pairs long and assembled into 132 contigs.
It encodes 2,903 predicted proteins.
Abstract
This study reports the draft genome sequence of Lacticaseibacillus paracasei MSR2, a strain isolated from yogurt in Bangladesh in 2024. The genome consists of a 3,085,195 base pair chromosome assembled into 132 contigs, encoding 2,903 predicted proteins.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
| Description of source | |
|---|---|
| Location | Kishoreganj, Bangladesh |
| Time | 2024 |
| Type | Yogurt |
| Sequencing summary | |
| Coverage | 111.8× |
| Total bases | 3,074,103 |
| GC content | 46.27% |
| Reads | 865,257 |
| Assembly report | |
| Contigs | 132 |
| GC content | 46.5% |
| Contig | 10 |
| Genome length | 3.1 Mb |
| Contig | 98.4 kb |
| Annotation report | |
| Genes | 3,079 |
| CDs | 2,903 |
| CDs (without protein) | 105 |
| ncRNAs | 3 |
| tRNA | 58 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbiotics and Fermented Foods · Genomics and Phylogenetic Studies · Oral microbiology and periodontitis research
ANNOUNCEMENTS
Lacticaseibacillus paracasei, a probiotic commonly present in dairy products, supports host health by offering numerous benefits (1, 2). L. paracasei has also been reported to exhibit antifungal and antibacterial activities (3). This study presents the draft genome assembly of L. paracasei MSR2.
A yogurt sample was collected from Kishoreganj, Bangladesh, in February 2024. The sample was spread onto De Man–Rogosa–Sharpe (MRS) agar (HiMedia, India) and incubated overnight at 37°C. A single colony from the pure culture was inoculated into MRS broth and incubated overnight at 37°C with shaking at 120 rpm. One milliliter of broth was used for genomic DNA extraction. DNA extraction was carried out using the Qiagen DNeasy Blood & Tissue Mini Kit (250) following the manufacturer’s protocol. After cell lysis, the sample was centrifuged at 14,000 rpm for 5 minutes to remove cellular debris. The DNA was further purified using the CDC PULSENET Total DNA Extraction protocol using the same kit. DNA quality was assessed using a Nanodrop spectrophotometer and a Qubit 4.0 Fluorometer. For Qubit measurements, a working solution was prepared by mixing Qubit Reagent and Qubit Buffer in a 1:200 ratio. Whole-genome sequencing was performed at the Genomic Centre, icddr’b, Mohakhali, Dhaka, Bangladesh. The DNA fragments were size selected using AMPure XP beads to isolate those within the desired range, typically 200–300 base pairs (bp) for standard Illumina sequencing libraries. The library for sequencing was prepared using the Illumina DNA Prep Reagent Kit and an automated liquid handler (epMotion 5075) for tagging the adapters with DNA fragments. After library preparation, sequencing was carried out using the Illumina MiSeq platform with paired-end 2 × 150 bp reads following the standard protocol provided by the manufacturer. Illumina’s bcl2fastq software version 2.20.0 (https://support.illumina.com/sequencing/sequencing_software/bcl2fastq-conversion-software.html) was utilized with default parameters to demultiplex the raw reads and convert them into FASTQ format. For further downstream analysis, raw paired-end reads from the MiSeq platform were used.
FASTQC version 0.9.1 (www.bioinformatics.bbsrc.ac.uk/projects/fastqc) was employed to assess the quality of the raw data (4). Sickle version 1.3 tool was used to trim the paired-end sequence, where the trimming length was set to 20 (5). For assembly, Spades version 4.0.0 was applied with default parameters (6). The assembly was polished using Pilon version 1.24 (7). The species was identified using a whole genome-based taxonomic analysis in the Type (Strain) Genome Server (https://tygs.dsmz.de). Default parameters were used for all software unless otherwise specified.
The genome of L. paracasei MSR2 is 3,085,195 bp long with a GC content of 46.5%. The assembly contains 132 contigs with a coverage of 111.8×. A total of 3,019 genes were predicted, of which 2,852 are protein-coding genes. The details of genomic features are presented in Table 1.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Bengoa AA, Dardis C, Garrote GL, Abraham AG. 2021. Health-promoting properties of Lacticaseibacillus paracasei: a focus on kefir isolates and exopolysaccharide-producing strains. Foods 10:2239. doi:10.3390/foods 1010223934681288 PMC 8534925 · doi ↗ · pubmed ↗
- 2Kumaree KK, Prasanth MI, Sivamaruthi BS, Kesika P, Tencomnao T, Chaiyasut C, Prasansuklab A. 2023. Lactobacillus paracasei HII 01 enhances lifespan and promotes neuroprotection in Caenorhabditis elegans. Sci Rep 13:16707. doi:10.1038/s 41598-023-43846-937794096 PMC 10550917 · doi ↗ · pubmed ↗
- 3Chuang LC, Huang CS, Ou-Yang LW, Lin SY. 2011. Probiotic Lactobacillus paracasei effect on cariogenic bacterial flora. Clin Oral Investig 15:471–476. doi:10.1007/s 00784-010-0423-9PMC 313376820502929 · doi ↗ · pubmed ↗
- 4Andrews S. 2010. Fast QC: a quality control tool for high throughput sequence data. Babraham Bioinformatics.
- 5Joshi NF. 2011. Sickle: a sliding-window, adaptive, quality-based trimming tool for Fast Q files. https://github.com/najoshi/sickle.
- 6Prjibelski A, Antipov D, Meleshko D, Lapidus A, Korobeynikov A. 2020. Using SP Ades De Novo assembler. CP in Bioinformatics 70:e 102. doi:10.1002/cpbi.10232559359 · doi ↗ · pubmed ↗
- 7Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. P Lo S One 9:e 112963. doi:10.1371/journal.pone.011296325409509 PMC 4237348 · doi ↗ · pubmed ↗
