Immunoinformatic Design and Evaluation of a Multi-Epitope mRNA Vaccine RP14914P Targeting Latent Tuberculosis Infection
Yuan Tian, Mingming Zhang, Syed Luqman Ali, Aigul Abduldayeva, Shuang Zhou, Yajing An, Yufeng Li, Ruizi Ni, Lingxia Zhang, Yanhua Liu, Weiguo Sun, Wenping Gong

TL;DR
This paper presents a new mRNA vaccine candidate, RP14914P, designed to prevent the reactivation of latent tuberculosis infection using computational methods.
Contribution
The novel contribution is the in silico design of a multi-epitope mRNA vaccine targeting LTBI with broad population coverage and predicted strong immune activation.
Findings
RP14914P encodes 14 cytotoxic T lymphocyte, 9 helper T lymphocyte, and 14 B-cell epitopes with high antigenicity and immunogenicity scores.
Molecular dynamics simulations confirm stable binding of the vaccine to TLR2 and TLR4 with docking energies of −1477.8 and −1480.1 kcal/mol, respectively.
Immune simulations predict strong Th1 polarization, high cytokine secretion, and durable memory responses.
Abstract
Background: Latent tuberculosis infection (LTBI) is the principal reservoir for active tuberculosis, with >85% of cases attributable to reactivation. Bacillus Calmette-Guérin fails to block this transition, leaving a critical gap in prevention. Methods: An immunoinformatics/reverse-vaccinology pipeline was applied to seven dormancy-related antigens retrieved from Mycobrowser. T-cell epitopes were predicted with NetMHCI/IIpan-4.1 and B-cell epitopes with ABCpred; antigenicity, allergenicity, and toxicity were evaluated with VaxiJen, AllerTOP, and ToxinPred. Secondary/tertiary structures were modeled with PSIPRED and AlphaFold-3; docking to Toll-like receptors (TLR) 2/4 and 100 ns molecular dynamics simulations assessed complex stability. Immune responses were simulated with C-ImmSim, and the mRNA sequence was human-codon-optimized using ExpOptimizer. Results: The resulting construct,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicsvaccines and immunoinformatics approaches · Tuberculosis Research and Epidemiology · Monoclonal and Polyclonal Antibodies Research
1. Introduction
Tuberculosis (TB), a chronic infectious disease caused by agents of the Mycobacterium tuberculosis complex (MTBC), remains a major global health threat [1,2]. Its control is critically undermined by the high prevalence and reactivation risk of latent tuberculosis infection (LTBI) [3,4,5,6,7]. According to the WHO Global Tuberculosis Report 2025, an estimated 10.7 million incident cases of active tuberculosis (ATB) and 1.23 million deaths occurred worldwide in 2024, and China alone contributed ~696,000 cases and 25,000 deaths [8]. Approximately one-quarter of the world’s population—nearly two billion individuals—harbor MTBC in a latent state. These asymptomatic carriers are at a lifelong risk of reactivation when immunity wanes, and >85% of ATB cases are attributed to LTBI reactivation [9]. This “latency-to-activation” cycle constitutes the central obstacle to TB eradication, and current interventions leave a decisive technological gap.
The only licensed vaccine, Bacillus Calmette-Guérin (BCG), protects infants against disseminated TB but shows highly variable efficacy (0–80%) in adolescents and adults [3,10,11]. During the past two decades, candidate vaccines entering clinical evaluation (e.g., M72/AS01E, H56:IC31, and GamTBvac) have focused on preventing primary disease or boosting BCG-primed immunity; none is explicitly designed to interrupt LTBI reactivation. Moreover, most formulations rely on antigens expressed during active replication (e.g., Ag85B and ESAT-6) and therefore provide limited coverage of the diverse antigenic repertoire displayed across MTBC’s multi-stage life-cycle [12,13,14,15]. Closing the “LTBI-to-ATB” bottleneck thus represents a pivotal strategic unmet need.
The rapid advancement of immunoinformatics and mRNA technology offers a revolutionary opportunity to address this gap systematically [16,17,18,19,20,21,22]. Compared to traditional platforms, mRNA vaccines possess distinct advantages: (i) design flexibility allows for the integration of multiple protective epitopes from different stages into a single transcript, enabling comprehensive, multi-stage targeting of the MTBC life-cycle; (ii) endogenously expressed antigens facilitate simultaneous activation of both major histocompatibility complex (MHC)-I and MHC-II pathways, synergistically inducing CD8^+^ Cytotoxic T Lymphocyte (CTL) and CD4^+^ Helper T cell (Th1) responses, thereby overcoming the limitations of current vaccines in eliciting robust CTL activity; and (iii) the platform allows rapid adaptation to pathogen variations through “programmable” sequence iterations to optimize population coverage and immunogenicity [23,24]. Notably, BNT164a1 and BNT164b1 have entered Phase I clinical trials, marking a transition of TB mRNA vaccines from proof-of-concept to clinical translation [11]. However, an mRNA vaccine specifically targeting the LTBI stage, systematically integrating multiple epitopes, and possessing broad population coverage potential is still lacking.
Based on this research landscape and technological need, this study employed immunoinformatics and reverse vaccinology strategies to undertake the following work (Figure 1): First, dominant epitopes capable of concurrently eliciting CTL, Helper T Lymphocyte (HTL), and B-cell responses were screened from key latency-associated antigens (Rv1736c, Rv1980c, Rv2656c, Rv2659c, Rv3872, Rv3873, and Rv3879c) of Mycobacterium tuberculosis (MTB) H37Rv strain, with their safety validated through predictions of antigenicity, allergenicity, and toxicity. Second, Toll-like receptors (TLR) 2/4 agonists and the PADRE helper peptide were incorporated as built-in adjuvants, and a multi-epitope mRNA vaccine (designated RP14914P) was assembled using specific linkers (AAY, GPGPG, and KK). Finally, comprehensive in silico evaluations were conducted to assess the vaccine’s physicochemical properties, global population coverage, secondary and tertiary structure stability, molecular interactions with TLRs, and simulated immune response profiles. This study aims to design a broad-spectrum, safe, and efficacious multi-epitope mRNA vaccine targeting LTBI, providing a novel candidate molecule to prevent the progression from LTBI to ATB, while also offering a technical reference for the rational design of TB mRNA vaccines.
2. Materials and Methods
2.1. Antigen Selection and Sequence Retrieval
Based on an extensive literature review and our prior research experience, we selected seven antigens—Rv1736c, Rv1980c, Rv2656c, Rv2659c, Rv3872, Rv3873, and Rv3879c (Table 1) [25,26,27,28,29,30]—from a pool of 21 latency-associated antigens for subsequent analysis. The selection criteria were: (i) confirmed expression during MTB latency; (ii) potential immunogenicity; and (iii) being in preclinical or clinical research stages, indicating a foundational level of prior investigation. The FASTA format amino acid sequences for all candidate antigens were retrieved from the Mycobrowser database (https://mycobrowser.epfl.ch/, accessed on 11 February 2025).
2.2. Prediction and Screening of Immunodominant Epitopes
To construct a multi-epitope vaccine capable of eliciting a comprehensive immune response, we systematically predicted and screened CTL, HTL, and B-cell epitopes.
2.2.1. CTL Epitope Prediction
CTL epitopes were predicted using the MHC-I binding prediction tool within the IEDB database (https://nextgen-tools.iedb.org/pipeline?tool=tc1, accessed on 25 February 2025) [31]. The NetMHCpan-4.1 EL (recommended epitope predictor-2023.09) and NetMHCpan-4.1 BA (recommended binding predictor-2023.09) algorithms were employed to predict 9-10mer peptides against the complete human leukocyte antigen (HLA) reference allele set. Initial screening criteria were: EL rank < 0.5% and BA IC50 < 50 nM. Subsequently, a secondary filter was applied, requiring: antigenicity (VaxiJen v2.0) > 0.5, immunogenicity (IEDB Immunogenicity tool) > 0, non-toxicity (ToxinPred2.0), and non-allergenicity (AllerTOP v.2.1). Duplicate and similar sequences were removed.
2.2.2. HTL Epitope Prediction
HTL epitopes were predicted using the IEDB MHC-II binding prediction tool (https://nextgen-tools.iedb.org/pipeline?tool=tc2, accessed on accessed on 25 February 2025) [32] with the NetMHCIIpan-4.1 EL algorithm, predicting 12-18mer peptides against the full HLA reference allele list. Initial screening criteria were: EL rank < 0.5% and antigenicity > 0.5. The secondary filtering criteria were identical to those for CTL epitopes. Subsequently, the IFNepitope (https://webs.iiitd.edu.in/raghava/ifnepitope/design.php, accessed on 25 February 2025) [33], IL4pred (https://webs.iiitd.edu.in/raghava/il4pred/index.php, accessed on 25 February 202525) [34], and IL10pred (https://webs.iiitd.edu.in/raghava/il10pred/predict3.php, accessed on 25 February 2025) [35] online servers were used to evaluate the epitopes’ ability to induce specific cytokines. Epitopes predicted to induce IFN-γ-positive responses, while not inducing interleukin-4 (IL-4) and IL-10 secretion, were prioritized to bias the immune response towards a Th1 phenotype.
2.2.3. B-Cell Epitope Prediction
Linear B-cell epitopes were predicted using the ABCpred server (https://webs.iiitd.edu.in/raghava/abcpred, accessed on 27 February 2025) [36], which employs a recurrent neural network. The threshold was set to 0.51, and the two highest-scoring epitopes from each antigen were selected.
2.2.4. Epitope Homology Prediction
To gauge the vaccine’s potential for cross-strain protection, we mapped the evolutionary conservation of every CTL, HTL, and B-cell epitope it contains. Using BLASTP (https://blast.ncbi.nlm.nih.gov/Blast.cgi, accessed on 12 January 2026) on the NCBI portal, we restricted the search space to the genus Mycobacterium (taxonomy ID: 1763) and queried each peptide in turn, quantifying its percent identity to the major lineages of the MTBC, as well as to selected non-tuberculous mycobacteria (NTM).
2.2.5. Epitope Safety Validation
All final selected epitopes underwent safety validation using VaxiJen v2.0 (https://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html, accessed on 5 March 2025) [37] for antigenicity, AllerTOP v.2.1 (https://www.ddg-pharmfac.net/allertop_test/, accessed on 7 March 2025) [38] for allergenicity, and ToxinPred2.0 (https://webs.iiitd.edu.in/raghava/toxinpred2/algo.html, accessed on 7 March 2025) [39] for toxicity, ensuring compliance with safety standards for vaccine components.
2.3. mRNA Vaccine Construct Assembly
The selected epitopes were tandemly assembled using appropriate amino acid linkers. To optimize antigen processing and presentation and minimize the generation of junctional epitopes, the AAY, GPGPG, and KK linkers were used to connect CTL, HTL, and B-cell epitopes, respectively. To enhance vaccine immunogenicity, a TLR2 agonist, a TLR4 agonist, and the universal helper peptide (PADRE) were incorporated at the N-terminus via an EAAAK rigid linker. Finally, a 6×His tag was added to the C-terminus to facilitate subsequent protein purification and identification. For mRNA vaccine production, the final mRNA vaccine construct was designed from 5′ to 3′ as follows: 5′ m7G Cap—5′ UTR—Kozak sequence—TPA (signal peptide)—EAAAK (linker)—Psmα4 (adjuvant)—GPGPG (linker)—PADRE—GPGPG/AAY (linker)—CTL epitopes—GPGPG—HTL epitopes—KK (linker)—B-cell epitopes—EAAAK—Rpfe (adjuvant)—6×His—MITD sequence—3′ UTR—Poly(A) tail [23,24,40,41].
2.4. Analysis of Physicochemical Properties, Antigenicity, and Safety of the Candidate Vaccine
The theoretical molecular weight, isoelectric point (pI), instability index, aliphatic index, and grand average of hydropathicity (GRAVY) of the vaccine construct were predicted using the ExPASy ProtParam tool (https://web.expasy.org/protparam, accessed on 12 March 2025) [42]. Its solubility in E. coli was predicted using the Protein-sol server (https://protein-sol.manchester.ac.uk, accessed on 10 March 2025) [43], with a value >0.45 considered indicative of good solubility. Antigenicity and immunogenicity were predicted using VaxiJen v2.0 and the IEDB Immunogenicity tool, respectively. Allergenicity and toxicity were assessed using AllerTOP v.2.1 and ToxinPred2.0.
2.5. Selection of the Optimal Vaccine Construct
To identify a superior candidate, the screened CTL, HTL, and linear B-cell epitopes were randomly combined. The resulting candidate vaccines were evaluated based on biological characteristics, physicochemical properties, and solubility, leading to the identification of the optimal set of CTL, HTL, and B-cell epitopes. These were then randomly combined with different TLR agonists and the helper peptide. The optimal combination of agonists and helper peptide was determined, and this final construct was designated RP14914P.
2.6. Secondary and Tertiary Structure Prediction and Validation
The secondary structure of the vaccine was predicted using the PSIPRED (http://bioinf.cs.ucl.ac.uk/psipred, accessed on 2 April 2025) and SOPMA (https://npsa.lyon.inserm.fr/cgi-bin/npsa_automat.pl?page=/NPSA/npsa_sopma.html, accessed on 2 April 2025) servers [44,45,46]. The tertiary structure was predicted using AlphaFold-3 (https://golgi.sandbox.google.com, accessed on 5 April 2025), and the best model was selected from the five generated. The preferred tertiary structure model was subsequently refined using GalaxyRefine (https://galaxy.seoklab.org, accessed on 10 April 2025). The refined model was validated using ProSA-web (https://prosa.services.came.sbg.ac.at/prosa.php, accessed on 11 April 2025) (Z-score), ERRAT (https://saves.mbi.ucla.edu, accessed on 10 April 2025) (structure quality), and UCLA-DOE LAB-SAVES v6.1 (https://saves.mbi.ucla.edu, accessed on 17 April 2025) (Ramachandran plot) [47,48,49,50,51].
2.7. Immunoinformatics and Interaction Analysis
2.7.1. Population Coverage Analysis
The projected population coverage of the selected MHC class I and II epitopes across different geographical regions was calculated using the IEDB population coverage calculation tool (http://tools.iedb.org/population, accessed on 25 April 2025) [52].
2.7.2. Discontinuous B-Cell Epitope Prediction
Conformational B-cell epitopes were predicted using the ElliPro server (http://tools.iedb.org/ellipro, accessed on 14 April 2025) with a threshold score set to >0.735 [53].
2.7.3. Molecular Docking
The crystal structures of TLR2 (PDB ID: 2Z7X) and TLR4 (PDB ID: 3FXI) were retrieved from the PDB database. Rigid body molecular docking between the vaccine (RP14914P) and TLR2/TLR4 was performed using the ClusPro 2.0 online server (https://cluspro.bu.edu, accessed on 22 April 2025) [54,55]. The cluster with the lowest binding energy was selected from the generated models for visualization analysis using PyMOL (Version 3.1.0) [56] and interaction analysis using LigPlot+ (Version 2.2) [57].
2.7.4. Molecular Dynamics (MD) Simulation
Here, 100 ns MD simulations were performed for the RP14914P-TLR2 and RP14914P-TLR4 complexes using the Desmond module of the Schrödinger Suite [58]. The systems were equilibrated under the OPLS4 force field in an NPT ensemble at 300 K and 1 atm. Trajectory analysis included calculating the root mean square deviation (RMSD), root mean square fluctuation (RMSF), and radius of gyration (Rg). Principal component analysis (PCA) and free energy landscape (FEL) analysis were performed based on Cα atom trajectories using GROMACS. The dynamic cross-correlation matrix (DCCM) was calculated using the Bio3D R package (Version 3.1.0).
2.7.5. Immune Simulation
The immune response elicited by the vaccine was simulated using the C-ImmSim server (https://150.146.2.1/C-IMMSIM/index.php, accessed on 25 June 2025) [59]. The simulation volume was set to 50, with three injections (time steps: 1, 90, and 180), and a total of 1050 steps (each step representing 8 h); other parameters were kept at default.
2.8. Codon Optimization and mRNA Structure Prediction
To optimize RP14914P expression in Homo sapiens, codon optimization was performed using the NovoPro ExpOptimizer tool (https://www.novoprolabs.com/tools/codon-optimization, accessed on 28 June 2025) [60], aiming for a Codon Adaptation Index (CAI) close to 1.0 and a GC content within the ideal range of 30–70%. The secondary structure and minimum free energy (MFE) of the optimized mRNA sequence were predicted using the RNAfold tool from the ViennaRNA Package 2.0 (http://rna.tbi.univie.ac.at, accessed on 6 August 2025) [61].
3. Results
3.1. Rational Design and Assembly of the Multi-Epitope mRNA Vaccine RP14914P
Based on the construction strategy illustrated in Figure 1, to obtain the optimal antigen epitopes, a three-step screening strategy was implemented as follows:
For CTL epitopes, an initial screening was performed using the EL rank and IC_50_ values generated by the NetMHCpan-4.1 tool, which yielded 383 epitopes from an initial pool of 3963 candidates (Figure 2A). Subsequent fine screening was conducted based on the criteria of antigenicity, immunogenicity, non-toxicity, and non-allergenicity, resulting in 39 epitopes (Figure 2A). After removing redundant epitopes, a final set of 31 CTL epitopes was obtained (Table S1).
For HTL epitopes, 658 epitopes were first selected on the basis of satisfying binding affinity and antigenicity requirements (Figure 2B). Further screening was then carried out to identify epitopes capable of inducing IFN-γ secretion without triggering the production of IL-4 and IL-10. This process yielded 37 HTL epitopes with a bias toward Th1-type immune responses (Figure 2B), which were further reduced to 13 unique epitopes after eliminating redundancy (Table S2).
In addition, 60 potential epitopes were predicted using the ABCpred server, from which the top two highest-scoring linear B-cell epitopes were selected per antigen (Table S3).
As outlined above, 31 dominant CTL, 13 HTL, and 14 B-cell epitopes survived the multi-step filtering pipeline. Because epitope order can shape immunogenicity [62,63], we next generated 20 topological scaffolds in which these lead epitopes were shuffled at random (Table S4). Comparative profiling of their biological behavior, physicochemical fingerprints, and predicted solubility identified scaffold No.13—carrying 14 CTL, 9 HTL, and 14 B-cell epitopes—as the top performer (Table 2), and it was advanced as the vaccine backbone.
Built upon scaffold No.13, we installed four TLR2 and five TLR4 agonists and, by permuting their positions, obtained 26 candidate molecules (Table S5). Ranking these constructs on antigenicity, immunogenicity, toxicity, allergenicity, molecular weight, theoretical pI, instability index, aliphatic index, GRAVY, and solubility singled out the combination of the TLR2 agonist Psmα4 and the TLR4 agonist Rpfe. This pair was grafted onto scaffold No.13 to yield the final mRNA vaccine, designated RP14914P (Figure 3A,B).
3.2. Prediction and Optimization of Secondary and Tertiary Structures
Secondary-structure prediction indicates 27.25% α-helix, 12.70% extended strand, and 60.05% random coil, offering an optimal balance between rigidity and surface plasticity for immune recognition (Figure 3C). AlphaFold-3 generated its tertiary model (Figure 3D).
To ensure the reliability of the 3D structure prediction, we used the Galaxy Refine server to perform structural optimization (Figure 4A). The generated models were evaluated based on GDT-HA, RMSD, MolProbity, Clash score, and Poor rotamers. Model 5 (Table S6) was selected as the final 3D model of the vaccine. The z-scores before and after optimization were −5.97 (Figure 4B) and −6.21 (Figure 4C), respectively, indicating that the quality of the model was improved after optimization.
In addition, Ramachandran plot analysis revealed that prior to optimization, 83.1% of the amino acid residues fell within the favored regions, 11.4% within the additionally allowed regions, 3.6% within the generously allowed regions, and 1.9% within the disallowed regions (Figure 4D). After optimization, the proportion of residues in the favored regions increased to 97.5%, with 1.9% in the additionally allowed regions, 0.3% in the generously allowed regions, and 0.3% in the disallowed regions (Figure 4E). These results further confirm the effectiveness of the model optimization process.
3.3. Favorable Physicochemical Profile, Safety, and Global Population Coverage of RP14914P
In silico profiling revealed robust antigenicity (VaxiJen 0.7797) and immunogenicity (IEDB score 8.58629), with no predicted allergenicity or toxicity. The protein exhibits a theoretical pI of 9.61, an instability index of 28.65 (<40 threshold), an aliphatic index of 67.27, and a GRAVY of −0.424, indicating a hydrophilic and stable molecule. Protein-sol predicted an E. coli solubility score of 0.513 (>0.45 cut-off), and estimated half-lives are 30 h (mammalian reticulocytes), 20 h (yeast), and 10 h (E. coli).
Allele frequency-weighted coverage analysis demonstrated that included MHC-I epitopes span 82.35% of the global population (peak 85.97% in North America), whereas MHC-II epitopes cover 99.67% (peak 100% in North America) (Figure 5), underscoring the construct’s broad ethnic applicability.
3.4. Conservation Analysis of Vaccine Epitopes Across Mycobacterium Strains
To evaluate the potential of the RP14914P vaccine to confer broad-spectrum immunity, we systematically analyzed the sequence conservation of its 37 core epitopes across 87 Mycobacterium strains, including major branches of the MTBC and clinically relevant NTM. The results are visualized as a heatmap (Figure 6).
BLAST revealed that the core immunogenic epitopes of the vaccine exhibited excellent cross-strain stability: 19 epitopes (approximately 51% of the total) showed 100% sequence identity in 87 tested strains, corresponding to large continuous deep-red regions in the heatmap. This indicates that these epitopes are fully conserved across a broad range of pathogen lineages, including major MTBC branches and multiple NTM species (e.g., Mycobacterium avium complex), providing a structural basis for the vaccine’s broad-spectrum protection.
The remaining 16 epitopes maintained high sequence identity (81–100%): 14 of these epitopes retained ≥80% identity across all strains. Only two B-cell epitopes (MTERCLSISHRVRVPE and SSTPVGQLPPAATQTL) exhibited <90% sequence identity, corresponding to blue-to-purple regions in the heatmap, with the lowest value being 75%. This suggests mild variation in these two epitopes in some strains, while their overall homology remains high.
Collectively, the 37 epitopes of the RP14914P vaccine demonstrated high conservation across 87 MTBC and NTM strains, with only minor variations in a small subset of epitopes. These findings strongly support the potential of the vaccine to induce broad-spectrum immune protection.
3.5. Prediction of Discontinuous B-Cell Epitopes
ElliPro discontinuous epitope mapping (threshold 0.735) identified 20 conformational B-cell epitopes (scores 0.735–0.962) distributed across the molecular surface (Table S7), implying potent capacity to elicit high-affinity antibody responses.
3.6. High-Affinity Docking to TLR2/4 and Robust MD Trajectories
ClusPro docking returned exceptionally low binding energies of −1477.8 kcal mol^−1^ (TLR2) and −1480.1 kcal mol^−1^ (TLR4), driven by 23 and 38 hydrogen bonds, respectively (Figure 7). During 100 ns MD, both complexes equilibrated within 10 ns and displayed stable RMSD plateaus (18–23 Å). RMSF profiles revealed localized flexible loops, whereas decreasing Rg indicated progressive compaction (Figure 8). PCA and free-energy landscape analyses showed that the TLR2 complex occupies a single deep energy basin, whereas the TLR4 ensemble samples multiple shallow minima, implying greater conformational plasticity that may facilitate adaptive immune signaling (Figure 9).
3.7. In Silico Immune Simulation Predicts Potent Multi-Layer Immunity
C-ImmSim projection revealed rapid and sustained activation of innate and adaptive compartments. Natural killer (NK) cells peaked at day 90 and stabilized at 62–75 cells/µL. Three waves of MHC-2 macrophages coincided with the reduction in resting macrophages, whereas activated dendritic cells steadily accumulated, reflecting efficient antigen presentation (Figure 10).
Adaptive markers showed three synchronized helper T (TH)/cytotoxic T (TC) cells activation peaks (days 10, 40, 70) and a sharp rise in memory B cells within 5–10 days, forecasting durable humoral memory. Cytokine signatures were dominated by IFN-γ and IL-2 (Th1 axis) alongside regulatory IL-10 and transforming growth factor-beta (TGF-β), predicting robust anti-mycobacterial effector function balanced by protective homeostasis.
3.8. Codon Optimization and mRNA Secondary-Structure Stability
Human codon optimization (ExpOptimizer) yielded a 2598-nt sequence with CAI = 0.81 and GC = 61.09%—parameters optimal for both structural stability and efficient translation. RNAfold predicted a MFE of −1213.9 kcal mol^−1^ for the dominant secondary structure and −648.15 kcal mol^−1^ for the centroid structure (Figure 11), confirming a thermodynamically stable yet flexible conformation compatible with high-level expression and prolonged half-life in mammalian cells.
4. Discussion
LTBI represents a major challenge in TB control, with approximately 85% of active TB cases stemming from the reactivation of LTBI, a process that the existing BCG vaccine fails to effectively prevent [5,64,65,66,67,68,69,70,71]. Addressing this critical issue, this study employed immunoinformatics and reverse vaccinology strategies to design and evaluate the multi-epitope mRNA vaccine RP14914P. This candidate aims to block the progression from LTBI to active TB by targeting MTB latency antigens, integrating multiple types of immune epitopes, and incorporating a dual TLR adjuvant system. The following discussion, based strictly on the data from this study and the published literature, elaborates on the design innovativeness, core characteristics, molecular mechanisms, and study limitations.
The innovative design of RP14914P is first reflected in its antigen and epitope screening strategy. Early TB mRNA vaccines often utilized a single antigen, for instance, ESAT6 [72] and MPT83 [73]. While it induced short-term protection, it did not cover latency antigens and was thus ineffective against LTBI. The Hsp65-based mRNA vaccine in 2010 shared similar limitations [74]. Even the BNT164a1/b1 vaccines, which have entered Phase I clinical trials and employ a multi-antigen design, do not explicitly incorporate LTBI-specific antigens [75,76].
In contrast, the seven latency-associated antigens selected in this study all possess the characteristics of “high expression during latency and potential for clinical translation,” aligning with the design concept of our group’s previous PP19128R vaccine [77], but with a more focused antigen selection, avoiding unnecessary antigen interference. Building on this, RP14914P integrates 14 CTL, 9 HTL, and 14 B-cell epitopes, forming a complete response chain covering both cellular and humoral immunity. Compared to the vaccine designed by Al Tbeishat et al. based on epigenetic-related proteins (containing 17 CTL, 5 HTL, and 8 B-cell epitopes) [78], our vaccine includes a greater number of B-cell epitopes, which is conducive to inducing stronger humoral immunity. Compared to our group’s ZL12138L vaccine [79], RP14914P has a further increased number of CTL and B-cell epitopes, suggesting superior potential in cytotoxic responses and antibody production.
In terms of vaccine construction strategy, RP14914P adopts a modular design, using AAY, GPGPG, and KK linkers to tandem CTL, HTL, and B-cell epitopes, respectively. This approach effectively maintains the independence of epitopes and avoids immune interference, which is consistent with multiple recent studies [40,78]. Furthermore, this study integrated TLR2/4 dual agonists and the PADRE helper peptide at the N-terminus via an EAAAK linker, reducing steric clashes while enhancing the synergy of innate immune activation. Compared to vaccines using only a single TLR adjuvant, such as the Al Tbeishat vaccine using only the TLR4 agonist RpfE [78,80] or the Shahrear et al. vaccine relying solely on the TLR2 agonist C5 peptide [23], the dual TLR agonist design of RP14914P can more comprehensively mimic natural immune recognition patterns and activate broader immune signaling pathways. Molecular docking results showed that the binding energies of RP14914P to TLR2 and TLR4 were −1477.8 kcal/mol and −1480.1 kcal/mol, respectively, significantly lower than those of our group’s previous PP19128R vaccine (−1324.77 kcal/mol and −1278 kcal/mol) and ZL12138L vaccine (−1173.4 kcal/mol and −1360.5 kcal/mol) [79], suggesting stronger binding stability and laying a structural foundation for efficient innate immune activation.
RP14914P is predicted to demonstrate promising translational potential in terms of safety, physicochemical properties, and population coverage. Its antigenicity score is 0.7797, and its immunogenicity score is 8.58629. Although slightly lower than the Al Tbeishat vaccine (antigenicity 0.814) and the PP19128R vaccine (immunogenicity 9.29811), it still meets the criteria for strong immunogenicity and has been systematically verified as non-toxic and non-allergenic. Compared to the RUTI vaccine, which has entered Phase II clinical trials and reported local adverse reactions [81], RP14914P holds an advantage in safety. Regarding physicochemical properties, its instability index is 28.65, lower than that of the Sharma vaccine (28.23) and the Al Tbeishat vaccine (33.51), indicating superior in vitro stability. Its solubility of 0.513, although lower than that of PP19128R (0.900675), still meets the basic requirements for formulation development. Population coverage analysis revealed that RP14914P achieves global HLA class I and II coverage of 82.35% and 99.67%, respectively, superior to the Al Tbeishat vaccine (99.38%) and the PP19128R vaccine (82.24%/93.71%), demonstrating broad potential applicability.
From a structural biology perspective, the secondary structure of RP14914P consists of 27.25% alpha-helices and 60.05% random coils. This proportion ensures structural stability while allowing sufficient epitope exposure [82,83,84]. Compared to the vaccine designed by Shi et al. for central nervous system TB [40], RP14914P has a higher number of amino acids (866 vs. 383) and greater structural complexity, enabling the accommodation of more epitopes. Compared to the Sharma vaccine (alpha-helix content 31%), RP14914P has a higher proportion of random coils, which is more favorable for B-cell receptor recognition. After optimization, the Ramachandran plot of the tertiary structure showed 97.5% of residues in the most favored regions, outperforming the Sharma vaccine (88.22%) and the HP13138PB vaccine (88.22%) [85], indicating high reliability of its structural model. Molecular dynamics simulations further revealed that the RP14914P-TLR2 complex exhibits concentrated conformations and high stability, whereas the RP14914P-TLR4 complex demonstrates higher conformational flexibility. This characteristic of “coexistence of stability and dynamics” may enable it to effectively activate TLR signaling pathways in different immune microenvironments, achieving sustained and adaptive regulation of the innate immune response [86,87,88].
Immune simulation results suggested that RP14914P may elicit a multi-layered immune response: NK cells peaked at 90 days post-immunization and remained stable, and activated macrophages and dendritic cells increased continuously, demonstrating its good capacity for innate immune activation. In terms of adaptive immunity, the vaccine exhibited robust T and B lymphocyte activation capacity. The total number of helper T (TH) cells, memory TH cells, and activated TH cells increased continuously, with three response peaks observed following the three immunizations. This indicates that HTL epitopes are presented to CD4^+^ T cells via MHC class II molecules, driving the differentiation of naive T cells into TH cells and memory TH cells, and further inducing a TH1-type immune response, as evidenced by significant elevations in IFN-γ and IL-2 levels [89,90,91].
Meanwhile, the number of activated cytotoxic T cells increased continuously, reaching a peak on day 50, while the number of naive TC cells decreased correspondingly. This suggests that naive T cells are effectively differentiating into effector cells, likely triggered by the binding of CTL epitopes to CD8^+^ T cells [92,93,94].
The B lymphocyte response was also prominent: memory B cells increased sharply within 5–10 days post-immunization, and activated B cells showed the same trend, whereas naive B cells decreased accordingly. These results confirm that the vaccine can rapidly induce B cell activation, antibody production, and memory formation, laying a foundation for long-term immune protection [95,96].
These simulation results, compared to the repRNA-ID91-NLC vaccine reported by Larsen et al. [97], suggest more sustained immune activation features. Compared to the mRNA prime-protein boost strategy proposed by Rais et al. [98], the Th1-type cytokine profile simulated for RP14914P better aligns with the requirements for anti-tuberculosis immunity. Although these are only computational simulation results, they are consistent with the in vitro experimental trends of our group’s previous PP13138R [99] and PP19128R [77] vaccines, providing strong support for its immunogenic potential.
Despite the multiple advantages demonstrated in the design of RP14914P, this study has certain limitations. First, all results are based on bioinformatics predictions and require further validation through in vitro experiments, for instance, drawing on the ELISPOT method used for PP13138R or the cytokine detection scheme for PP19128R. N-methylpseudouridine modification will be incorporated during subsequent experimental synthesis to optimize the performance of the mRNA vaccine. Second, this study did not evaluate potential immune competition between epitopes or immunosuppressive effects mediated by regulatory T cells; this could be improved by referring to the analysis approach of T cell exhaustion markers used by Hu et al. [100]. Furthermore, the potential issue of overly strong immune reactions triggered by the dual TLR agonists may require further investigation. Finally, the optimization of the mRNA delivery system has not been addressed; strategies could explore the impact of delivery systems like lipid nanoparticles on vaccine efficacy, potentially drawing lessons from the antigen combination and delivery strategy of the B21 DNA vaccine by Weng et al. [101].
5. Conclusions
In this study, we designed and comprehensively evaluated RP14914P, a multi-epitope mRNA vaccine specifically targeting LTBI, using immunoinformatics and reverse vaccinology approaches. By integrating a three-step immunoinformatic screening pipeline, we selected 37 high-confidence T- and B-cell epitopes from seven well-characterized MTB latency-associated antigens. These epitopes were rationally assembled using immunologically optimized linkers and further reinforced with a built-in dual TLR2/4 agonist adjuvant system to enhance innate immune activation.
Computational structural analyses predicted that the vaccine exhibits a stable and soluble conformation, with 97.5% of residues in the most favored regions of the Ramachandran plot after refinement. Molecular docking and 100 ns MD simulations predicted strong and stable binding to TLR2 and TLR4, suggesting its potential to effectively engage pattern-recognition receptors. Importantly, conservation analysis revealed that the selected epitopes are highly conserved across 87 MTBC and NTM strains, with over 50% showing 100% sequence identity, indicating an indicator of potential broad-spectrum coverage.
In silico immune simulation projected robust and coordinated immune activation, characterized by Th1-polarized cytokine secretion (IFN-γ and IL-2), strong cytotoxic and helper T-cell responses, and durable B-cell memory. Together with a favorable safety profile, high predicted population coverage (≥82% for HLA-I and ≥99% for HLA-II), and optimized mRNA stability, RP14914P represents a rationally designed, computationally validated candidate that may specifically address the critical gap in preventing LTBI reactivation.
It is important to emphasize that all findings presented in this study are based on computational predictions and in silico analyses. While these results are promising and provide a strong rationale for further development, they require thorough experimental validation, including in vitro immunogenicity assays and in vivo challenge studies to confirm the vaccine’s safety, immunogenicity, and protective efficacy. This study provides a comprehensive immunoinformatic blueprint that can guide and accelerate subsequent experimental efforts toward developing a next-generation TB vaccine.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Liu Y. Yang L. Meskini M. Goel A. Opperman M. Shyamal S.S. Manaithiya A. Xiao M. Ni R. An Y. Gut microbiota and tuberculosis I Meta 20254 e 7005410.1002/imt 2.7005440860431 PMC 12371273 · doi ↗ · pubmed ↗
- 2Chen Z. Wang T. Du J. Sun L. Wang G. Ni R. An Y. Fan X. Li Y. Guo R. Decoding the WHO Global Tuberculosis Report 2024: A Critical Analysis of Global and Chinese Key Data Zoonoses 20255110.15212/ZOONOSES-2024-0061 · doi ↗
- 3An Y. Ni R. Zhuang L. Yang L. Ye Z. Li L. Parkkila S. Aspatwar A. Gong W. Tuberculosis vaccines and therapeutic drug: Challenges and future directions Mol. Biomed.20256410.1186/s 43556-024-00243-639841361 PMC 11754781 · doi ↗ · pubmed ↗
- 4Hatzenbuehler L.A. Starke J.R. Current Diagnosis and Treatment of Pediatric Latent Tuberculosis Infection Curr. Pediatr. Rep.2014214515510.1007/s 40124-014-0054-8 · doi ↗
- 5Doshi J. Deokar K. Gaikwad P. Novel tuberculosis skin tests for detecting latent tuberculosis infection Monaldi Arch. Chest Dis.202495315610.4081/monaldi.2024.315639498659 · doi ↗ · pubmed ↗
- 6Chee C.B.E. Reves R. Zhang Y. Belknap R. Latent tuberculosis infection: Opportunities and challenges Respirology 20182389390010.1111/resp.1334629901251 · doi ↗ · pubmed ↗
- 7Carranza C. Pedraza-Sanchez S. de Oyarzabal-Mendez E. Torres M. Diagnosis for Latent Tuberculosis Infection: New Alternatives Front. Immunol.202011200610.3389/fimmu.2020.0200633013856 PMC 7511583 · doi ↗ · pubmed ↗
- 8WHO Global Tuberculosis Report 2025 World Health Organization Geneva, Switzerland 2025176
