Characterization of Salmonella spp. Based on CRISPR PCR and FT-IR Approaches: A Pilot Study in Northern Italy
Clara Tramuta, Irene Floris, Monica Pitti, Giulia Federica Cazzaniga, Miriam Cordovana, Daniela Manila Bianchi

TL;DR
This study uses CRISPR PCR and FT-IR to quickly identify and differentiate Salmonella strains, showing promise for tracking their spread.
Contribution
The study introduces a combined CRISPR PCR and FT-IR-LDA method for rapid and accurate Salmonella serovar discrimination.
Findings
CRISPR PCR produced distinct amplicon sizes for each Salmonella serovar.
FT-IR spectra with LDA separated Salmonella into three distinct groups.
The combined approach showed potential for epidemiological surveillance of Salmonella.
Abstract
The aim of the present study was to employ a genotypic method (CRISPR PCR) and a phenotypic method (FT-IR combined with Linear Discriminant Analysis, LDA) for the rapid identification and discrimination of major virulent Salmonella serovars. Specifically, a total of 24 Salmonella Enteritidis, 24 Salmonella Typhimurium, 24 S. Typhimurium 4,5,12:i:-, and 14 Salmonella Infantis strains, previously serotyped according to the Kaufmann–White scheme, were analyzed. CRISPR PCR generated serotype-specific amplicons: 850 bp for S. Enteritidis, 700 bp and 2000 bp for S. Typhimurium, 1300 bp and 1500 bp for S. Typhimurium 4,5,12:i:-, and 1000 bp and 1900 bp for S. Infantis. FT-IR spectra of all serotypes were analyzed by LDA, which revealed a clear separation of three serotype-defined groups: S. Enteritidis, S. Infantis, and a group comprising S. Typhimurium and its monophasic variant 4,5,12:i:-.…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9- —Italian Ministry of Health
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSalmonella and Campylobacter epidemiology · Vibrio bacteria research studies · Escherichia coli research studies
1. Introduction
Salmonella enterica is widespread in nature, with over 2600 different serovars, and can infect the intestines of animals and humans, causing salmonellosis [1]. Human salmonellosis is primarily contracted through the consumption of contaminated water, foods of animal origin, and raw or undercooked fruits and vegetables [2]. Transmission can also occur via contact with contaminated surfaces, tools, or food handled by infected individuals [3].
According to the annual report of the European Food Safety Authority (EFSA) and the European Centre for Disease Prevention and Control (ECDC) [4], Salmonella enterica is the second most common foodborne pathogen in humans in the European Union [4]. In 2023, 77,486 confirmed cases of human salmonellosis were reported, corresponding to a notification rate of 18.0 cases per 100,000 population. The most frequently isolated serovars in human infections were S. Enteritidis (70.8%), S. Typhimurium (8.9%), its monophasic variant S. 4,5,12:i:- (5.1%), and S. Infantis (2%) [4].
In poultry flocks, the National Salmonellosis Control Plan (NCCP 2022–2024) [5] identifies S. Enteritidis, S. Typhimurium, and the monophasic variant of S. Typhimurium as the serovars of greatest epidemiological relevance. Regulation (EU) 2073/2005 [6], as amended, lists these serovars as food safety criteria in fresh poultry meat.
Isolation and typing of Salmonella strains along the food chain are therefore essential; however, traditional methods, including the official serotyping method UNI EN ISO 6579-3:2017 [7], are time-consuming, costly, and require specialized personnel. Early identification of these serovars is critical for effective control. To reduce time and costs, the development of innovative, rapid, and easily applicable methods is therefore highly desirable.
The Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) molecular complex has the function of providing prokaryote cells with a form of acquired immunity, representing a kind of “immune system” capable of eliminating foreign DNA [8]. At the genetic level, the CRISPR complex consists of DNA segments containing short palindromic repeats, separated at regular intervals by small spacer DNA segments [9]. The characterization of CRISPR alleles provides information on the content of spacers to perform typing and subtyping of bacterial strains. In recent years, CRISPR sequences have been identified in about 40% of bacterial genomes, including Escherichia coli, Salmonella spp., and Staphylococcus aureus [10]. Salmonella contains two different CRISPR loci (CRISPR 1 and CRISPR 2) related to some of the main Salmonella serovars [11,12], such as S. Enteritidis and S. Typhimurium. Recently, molecular typing techniques have been successfully applied as an alternative or support to traditional methods, as they ensure more accurate characterization, faster execution and application to a wider variety of bacterial species. Size determination of the PCR product can be utilized as a quick screen within the context of an outbreak to rapidly compare and identify isolates with CRISPR arrays that are the same size as that of the outbreak strain [11]. While the fundamental mechanisms of CRISPR systems are well-documented, their application as practical typing tools in regional surveillance programs requires further exploration to evaluate their applicability and cost-effectiveness compared to gold-standard methods.
Fourier transform infrared spectroscopy (FT-IR) is based on the analysis of the biomolecular composition of the intact microbial cells and allows the rapid discrimination of microorganisms at different taxonomic levels [13,14,15]; it can be applied in many fields, including clinical [16,17,18,19,20] and food safety [21,22,23]. Compared with other typing methods, it is a rapid and cost-effective method [23,24].
In microbiology, IR spectroscopy has numerous applications [15] described in the literature: strain typing [11,14,23,25,26,27], discrimination of biofilm-producing strains [28,29], differentiation between living and dead cells [22], detection of stress-injured cells [30] and outbreak investigation [18,20,31].
In the field of food safety, FT-IR spectroscopy has been used to investigate both pathogenic [31] and probiotic bacterial strains [32,33,34,35,36].
FT-IR-based analysis is particularly suitable for Salmonella enterica, which has high antigenic diversity associated with clinical relevance [14]. For this reason, numerous scientific studies relating to this bacterial species are available in the bibliography [14,22,23,26,27,28,29,30,37]. FT-IR spectroscopy is often integrated with other advanced techniques, including genetic analysis, to provide more in-depth information on microbial functions [17,27].
The aim of this pilot study was to compare, for the first time, a genotyping approach based on CRISPR sequences with a phenotypic characterization method using FT-IR as diagnostic tools. These approaches, which are faster and simpler than traditional serotyping and more cost-effective than whole-genome sequencing, were evaluated for their ability to differentiate clinically relevant Salmonella strains not only at the serovar level but also at the individual strain level. In particular, Linear Discriminant Analysis (LDA) was applied to FT-IR spectra to assess whether infrared bacterial typing (IRBT) captures serovar- and CRISPR-associated biochemical signatures in Salmonella, providing a supervised multivariate approach for strain discrimination.
2. Materials and Methods
2.1. Salmonella Strains
Based on the serovars of greatest epidemiological interest, a total of 86 Salmonella strains selected from the collection of the Piedmont Regional Reference Centre for Salmonella Typing (Centro di Riferimento per la Tipizzazione delle Salmonelle, CeRTiS) were analyzed. Isolates, collected via laboratory passive surveillance in Piedmont and Liguria regions (Northwest Italy) in the period of 2007–2024, were identified by MALDI TOF/TOF mass spectrometry (Bruker Daltonics GmbH & Co. KG, Bremen, Germany) and successively serotyped according to the Kaufmann–White classification scheme [38]. The following serovars were selected for this study:
- (i)S. Enteritidis (n. 24) of animal, food and environmental origin (Table 1);
- (ii)S. Typhimurium (n. 24) of animal and food origin (Table 2);
- (iii)Monophasic variant of S. Typhimurium (S. 4,5,12:i:-) (n. 24) of human origin (Table 3);
- (iv)S. Infantis (n. 14) of human, food and animal origin (Table 4).
2.2. DNA Extraction and CRISPR PCR Approach
Genomic DNA from each isolate was extracted by using an Instagene matrix (Bio-Rad, Richmond, CA, USA). The CRISPR 2 primer pair (B1 and B2) used for monoplex PCR were taken from Fabre et al. (2012) [11]. The specificity of the method was checked in silico on CRISPR 2 sequences of representative Salmonella strains (e.g., S. enteritidis JF725428 and JF72543; S. Typhimurium JF725638; S. 4,5,12:i:- JF725292 and JF725259) using multiple-alignment software provided in the BioEdit package, version 7.2.6.1. PCR was then performed in a reaction volume of 25 μL using 1X DreamTaq Hot Start PCR Master Mix (Thermo Scientific, Waltham, MA, USA), 10 μM of each primer and 5 μL of DNA. Amplification was carried out in a thermal cycler CFX (Bio-Rad, Richmond, CA, USA) using an initial denaturation for 3 min at 95 °C, 35 cycles consisting of denaturation for 30 sec at 95 °C, annealing for 1 min at 59 °C, and extension for 1 min at 72 °C, followed by a final extension for 10 min at 72 °C. Electrophoresis of amplified products was carried out using 2% agarose gel in 1X TAE running buffer. The agarose gel was then stained with GelRed (Biotium, Fremont, CA, USA) and visualized by ultraviolet transillumination Gel-Doc (Bio-Rad, Richmond, CA, USA).
2.3. FT-IR Approach
The analyzed strains were incubated at 37 °C for 24 h on Blood Agar under standard atmospheric conditions. Sample preparation was performed using Biotyper IR Kits (Bruker Daltonics GmbH & Co. KG, Bremen, Germany) following the manufacturer’s instructions. Briefly, a bacterial suspension was prepared by collecting several colonies from the confluence zone of the culture plate and resuspending them in 50 μL of 70% ethanol. The suspension was vortexed, and 50 μL of deionized water was added. After mixing by pipetting, 15 μL of the suspension was placed in five technical replicates on a 96-spot silicon IRBT target plate.
The quality control standards, Infrared Test Standards (IRTS 1 and IRTS 2) provided with the kit, were resuspended in 90 μL of deionized water and 90 μL of absolute ethanol, then vortexed. We applied 12 μL of each suspension in duplicate to the plate. The plate was dried for 15–20 min at 35 °C.
Spectra were acquired in transmission mode in the spectral range of 500–4000 cm^−1^ (mid IR) using OPUS V7.5.18 (Bruker Optics, GmbH, Ettlingen, Germany) and IR Biotyper Client Software V3.0 (Bruker Daltonics GmbH, Bremen, Germany), applying the manufacturer’s recommended settings. In the IR Biotyper system, after acquisition, the spectra are smoothed using the Savitzky–Golay algorithm, the second derivative over 9 datapoints is calculated, and the spectra are then vector-normalized, to correct variations related to spectra acquisition and sample amount. No outlier spectra were actually found, and all the recorded spectra were included in the dataset.
Spectral analysis was performed using Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA), with serovar (S. Enteritidis, S. Typhimurium, S. 4,5,12:i:-, and S. Infantis) as the class label, to evaluate the discriminatory ability of IRBT spectra. The selected spectral regions (700–1300, 1400–1500, and 2800–3000 cm^−1^) were chosen based on preliminary exploratory analysis, which identified these intervals as those exhibiting the highest inter-group variability. These regions correspond to well-known biochemical vibrational bands, including nucleic acids, proteins, lipids, and CH stretching modes, which are commonly reported in FTIR-based discrimination studies. Subsequently, each serovar (S. Typhimurium, monophasic variant of S. Typhimurium, and S. Infantis) was analyzed separately by PCA/LDA using the same spectral regions. The number of LDA components was selected based on the deviation plot and by retaining those maximizing class separation while minimizing within-class variance, in accordance with standard discriminant analysis procedures. Within each serovar, strains were assigned to discrete classes based on their CRISPR amplicon patterns, which were used as the grouping variable. For S. enteritidis, spectral distances were calculated using the Euclidean distance metric, and hierarchical clustering was performed using the UPGMA algorithm (average linkage). Clusters were defined using a cut-off value of 0.203, automatically calculated by the software on the bases of Simpson’s diversity index and mean coherence.
3. Results
3.1. CRISPR PCR Approach
The CRISPR PCR used in this study was able to discriminate four clinically relevant serovars of Salmonella spp. The results showed that CRISP 2 monoplex PCR generated Salmonella serovar-specific amplicons of ~850 bp for S. Enteritidis (100% of strains), ~700 bp/2000 bp for S. Typhimurium (100% of strains), ~1300 bp/1500 bp for S. Typhimurium 4,5,12:i:- (100% of strains) and ~1000 bp/1900 bp for S. Infantis (85.7% of strains) (Table 5). Separation of the fragments by agarose gel electrophoresis was clear, easy to interpret and showed no interaction among Salmonella species. Amplificons obtained from representative Salmonella strains are shown in Figure 1, Figure 2, Figure 3 and Figure 4 and are comparable to those obtained previously [11,12].
3.2. FT-IR Approach
A total of 430 spectra (120 S. Enteritidis, 120 S. Typhimurium, 120 monophasic variants of S. Typhimurium, and 70 S. Infantis) were acquired.
FT-IR spectra from all serovars were analyzed by PCA/LDA, which resulted in a clear separation of three serovar-defined groups: S. Enteritidis, S. Infantis, and a group comprising S. Typhimurium and its monophasic variant (Figure 5).
When PCA/LDA was applied separately within each serovar using CRISPR amplicon profiles as predefined classes, additional discrimination among CRISPR genotypes was observed (Table 5). Within S. Typhimurium, two CRISPR-defined classes were discriminated (Figure 6), corresponding to strains carrying a 2000 bp amplicon and those carrying a 700 bp amplicon. Similarly, two CRISPR-defined classes were discriminated within the monophasic variant of S. Typhimurium (Figure 7), associated with 1500 bp and 1300 bp amplicons, respectively. Within S. Infantis, three CRISPR-defined groups were separated by LDA, corresponding to strains with a 1900 bp amplicon, those with a 1000 bp amplicon, and strains lacking detectable amplicons (Figure 8). For S. enteritidis, hierarchical clustering of FT-IR spectra identified five distinct clusters among the S. enteritidis isolates using a cut-off value of 0.203 (average Euclidean distance). The dendrogram showed a high cophenetic correlation coefficient (CCC = 0.865), indicating a good representation of the original spectral distances. No association was observed between the origin of the isolates and cluster formation (Figure 9).
4. Discussion
In Italy, Salmonellosis is the most common detected food-borne infection, with over 3000 reported cases annually [4]. Based on the distribution of serovars collected by CeRTiS, the most common serovars detected in 2024 in the Piedmont and Liguria regions were S. 4, [5],12:i:- (46.94%), S. Typhimurium (11.13%), and S. Enteritidis (8.90%) [39,40]. Prevention, control, and monitoring programs to reduce the presence of Salmonella strains in the food chain, together with a better knowledge of circulating Salmonella serovars and contamination sources, may contribute to defining food safety intervention priorities and implementing appropriate control measures [41].
Approaches alternative to serotyping are useful to rapidly identify and discriminate the most relevant Salmonella serovars. For these reasons, the aim of the present study was to increase the capacity for analysis and characterization of Salmonella strains using valid, rapid, inexpensive and not extremely laborious methods that can potentially be applied in molecular biology and microbiology laboratories in order to improve the surveillance of Salmonella infections. In general, the application of the methods used in this pilot study, on locally circulating strains of human, animal and food origin, makes it possible to define the genotypic and phenotypic profiles of bacterial strains to set up epidemiological investigations with the identification of public health-relevant profiles and to improve the ability to monitor the emergence of particular epidemic clones. The results showed a correspondence between CRISPR amplicon profiles and FT-IR biochemical signatures. However, it is important to note that this relationship remains an observed correlation within the current sample set rather than a demonstrated mechanistic link. While both methods appear to reflect serovar-specific characteristics, these findings could also be influenced by the clonal population structure of the isolates. Therefore, further genomic and biochemical studies are required to confirm whether specific CRISPR genotypes directly influence the phenotypic expression levels captured by FT-IR spectroscopy.
Salmonella contains two different CRISPR loci that correlate with some of the main Salmonella serovars, such as S. Enteritidis and S. Typhimurium, considered the most common human serovars as well as the most relevant for public and veterinary health. CRISPR polymorphism has been optimized and applied for the identification and characterization of clinically relevant Salmonella serovars in recent years [11,12,42,43,44]. The CRISPR-PCR method, based on the CRISPR 2 locus here applied, made possible in a single analytical run and using a single pair of primers, can discriminate Salmonella strains belonging to four different serovars, generating specific amplicons for S. Enteritidis, S. Typhimurium, the monophasic variant of S. Typhimurium (S. 4,5,12:i:-) and S. Infantis. The data obtained show that this approach can be used by official and microbiology laboratories both as a new method for identifying clinically relevant Salmonella serovars, and as a method of support for the classification of serovars according to the White–Kauffmann and Le Minor scheme. In addition, the method allows us to simplify the typing of Salmonella by multiplex PCR of fljB, gyrB and yefQ genes, used today as a standard method to discriminate S. Typhimurium from its monophasic variant 4,5,12:i:-. While the CRISPR-PCR assay demonstrates potential for rapid screening, comprehensive metrics for sensitivity and specificity remain to be established through broader validation studies.
Salmonella enterica exhibits high antigenic diversity, which is closely associated with its clinical relevance [26]. Consequently, numerous studies have applied FT-IR-based strain typing for the characterization of Salmonella isolates [14,22,23,30]. However, FT-IR spectra alone may not provide comprehensive insights into microbial genotypes and functions, and they are best integrated with advanced molecular approaches, such as CRISPR analysis, to achieve more detailed strain characterization [17,27].
In this study, LDA was applied to FT-IR spectra to assess the separation of predefined serovar- and CRISPR-defined classes. The LDA results suggest that differences in serovar and CRISPR profiles are accompanied by consistent changes in the biochemical composition of bacterial cells, as captured by IRBT.
Our preliminary results indicate that FT-IR, combined with LDA, can discriminate major serovars, including S. Enteritidis, S. Infantis, and S. Typhimurium. However, FT-IR could not fully separate S. Typhimurium from its monophasic variant 4,5,12:i:-, highlighting the need for complementary molecular approaches. In such cases, CRISPR-PCR remains an essential complementary tool for achieving full serovar discrimination. This study was designed as an exploratory proof-of-concept investigation with a limited sample size. Consequently, a formal independent training/testing split was not performed. This limitation should be considered when interpreting the generalizability of the proposed FT-IR model. Additional studies with larger sample sizes are required to further evaluate the discriminatory performance of FT-IR, particularly for these closely related serovars. Using a cut-off of 0.203, five spectral clusters were identified for S. Enteritidis among the analyzed S. Enteritidis isolates. While these clusters suggest the presence of five distinct groups, genomic data would be required to confirm their biological significance. In the case of S. Enteritidis, hierarchical clustering revealed multiple spectral clusters; however, no clear association with isolate origin was observed. We acknowledge that this indicates that FTIR-based clustering reflects biochemical variability rather than epidemiological or phylogenetic relationships.
Despite the limited dataset of 86 isolates, our findings suggest that this methodology could serve as a valuable screening tool. However, further studies with a larger and more geographically diverse panel are required to confirm its generalizability. This analytical approach could be used to support the official ISO method or as a screening tool for the rapid identification of these relevant serotypes. Moreover, when compared to current gold-standard techniques such as Whole-Genome Sequencing (WGS) or Matrix-Assisted Laser Desorption/Ionization Time-of-Flight (MALDI-TOF), the proposed CRISPR/FT-IR approach offers significant advantages in terms of speed and lower operational costs. However, it lacks the high-resolution genomic depth of WGS and requires rigorous local database calibration to overcome the inherent sensitivity of FT-IR to varying growth conditions.
In conclusion, the observed correlation between CRISPR amplicon profiles and FT-IR biochemical signatures suggests a potential link between genotypic and phenotypic traits in the studied isolates. However, this remains a preliminary observation, and further studies are needed to determine whether this reflects a direct mechanistic relationship or is a consequence of the clonal population structure.
5. Conclusions
The method of CRISPR PCR here employed was found to be economical and quick to perform, suggesting it could serve as a promising tool for routine laboratory screening, provided further large-scale validation is conducted. The ‘serovar specificity’ of the observed amplicons requires further confirmation through amplicon sequencing and testing against a broader, more diverse panel of Salmonella lineages. By maximizing the separation between predefined biological groups, LDA demonstrated that both serovars and CRISPR genotypes are associated with distinct infrared spectral fingerprints, indicating that genomic variation is reflected in measurable biochemical differences. We therefore consider that it is important to continue this type of investigation in order to evaluate a possible application of the method to other clinically relevant and circulating Salmonella serovars.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Ferrari R.G. Rosario D.K.A. Cunha-Neto A. Mano S.B. Figueiredo E.E.S. Conte-Junior C.A. Worldwide Epidemiology of Salmonella Serovars in Animal-Based Foods: A Meta-Analysis Appl. Environ. Microbiol.201985 e 00591-1910.1128/AEM.00591-1931053586 PMC 6606869 · doi ↗ · pubmed ↗
- 2Bintsis T. Foodborne Pathogens AIMS Microbiol.2017352956310.3934/microbiol.2017.3.52931294175 PMC 6604998 · doi ↗ · pubmed ↗
- 3Zizza A. Fallucca A. Guido M. Restivo V. Roveta M. Trucchi C. Foodborne Infections and Salmonella: Current Primary Prevention Tools and Future Perspectives Vaccines 2024132910.3390/vaccines 1301002939852807 PMC 11768952 · doi ↗ · pubmed ↗
- 4European Food Safety Authority (EFSA) European Centre for Disease Prevention and Control (ECDC) The European Union One Health 2024 Zoonoses Report EFSA J.202523 e 975910.2903/j.efsa.2025.975941377167 PMC 12686834 · doi ↗ · pubmed ↗
- 5National Salmonellosis Control Plan in Poultry 2022–2024 Available online: https://www.izs.it/IZS/Benv-News-eng/National_Salmonellosis_Control_Plan_in_Poultry_2022-2024(accessed on 27 January 2026)
- 6European Union Commission Regulation (EC) No 2073/2005 of 15 November 2005 on Microbiological Criteria for Foodstuffs (Text with EEA Relevance)European Union Brussels, Belgium 2005 Volume 338
- 7ISO/TR 6579-3:2014 UNI Ente Italiano di Normazione International Organization for Standardization Geneva, Switzerland Available online: https://store.uni.com/iso-tr-6579-3-2014(accessed on 27 January 2026)
- 8Li J. Wu S. Zhang K. Sun X. Lin W. Wang C. Lin S. Clustered Regularly Interspaced Short Palindromic Repeat/CRISPR-Associated Protein and Its Utility All at Sea: Status, Challenges, and Prospects Microorganisms 20241211810.3390/microorganisms 1201011838257946 PMC 10820777 · doi ↗ · pubmed ↗
