Could vector-derived cadherin mimicry contribute to pemphigus vulgaris? An immunogenetic and in silico study involving HLA-DRB104:02 and 14:01
Bayram Toraman, Burak Kaan Kasap, Hande Ermis, Deniz Aksu Arica, Savaş Yayli

TL;DR
This study explores how genetic factors and vector-derived proteins might contribute to the development of pemphigus vulgaris, an autoimmune skin disease.
Contribution
The study introduces a novel Vector-derived Cadherin Mimicry framework linking genetic susceptibility and environmental exposure in pemphigus vulgaris.
Findings
HLA-DRB1*04:02 and HLA-DRB1*14:01 alleles are significantly enriched in pemphigus vulgaris patients.
Vector-derived cadherin-like peptides show structural compatibility with disease-associated HLA alleles.
The proposed VCM framework suggests environmental exposure to vector proteins may trigger disease in genetically predisposed individuals.
Abstract
Pemphigus vulgaris (PV) is a potentially life-threatening autoimmune blistering disease characterized by autoantibodies directed against the desmosomal cadherins desmoglein 3 (DSG3) and desmoglein 1 (DSG1), which are essential for keratinocyte adhesion. While the pathogenic role of these autoantibodies is well established, the upstream events leading to loss of immune tolerance against desmogleins remain incompletely understood. Genetic susceptibility conferred by HLA class II alleles and environmental exposures are thought to interact during disease initiation. In this study, we investigated HLA-DRB1 allele and genotype associations in Turkish PV patients and explored a hypothesis-generating framework linking genetic susceptibility with environmental exposure. Specifically, we examined whether vector-derived cadherin-like proteins could represent potential molecular mimics of…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5| HLA-DRB1* | PV (n) | % | HLA-DRB1* | Control (n) | % |
|---|---|---|---|---|---|
|
| 70 | 40.70 | 11:01 | 39 | 9 |
|
| 31 | 18.02 | 11:04 | 39 | 9 |
| 11:04 | 9 | 5 | 07:01 | 37 | 9 |
| 01:01 | 7 | 4 | 15:01 | 33 | 8 |
| 04:03 | 7 | 4 | 03:01 | 26 | 6 |
| 07:01 | 7 | 4 | 16:01 | 26 | 6 |
| 13:02 | 7 | 4 | 01:01 | 23 | 5 |
| 11:01 | 6 | 3 | 13:01 | 19 | 4 |
| 15:01 | 6 | 3 |
| 18 | 4.50 |
| 03:01 | 5 | 2 |
| 17 | 4.25 |
|
|
|
|
|
|
| HLA-DRB1* | Control (n) | PV (n) | p | adj_p | OR | 95% CI |
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 14:04 | 0 | 4 | 0.00 | 0.06 | 20.19 | 1.46-∞ |
| 15:01 | 33 | 6 | 0.02 | 0.10 | 0.40 | 0.12-0.91 |
| 13:03 | 12 | 0 | 0.02 | 0.10 | 0.08 | 0-0.77 |
| 07:01 | 37 | 7 | 0.03 | 0.13 | 0.41 | 0.17-0.97 |
| 09:01 | 8 | 0 | 0.06 | 0.22 | 0.12 | 0-1.27 |
| 04:01 | 16 | 1 | 0.07 | 0.23 | 0.19 | 0.029-1.14 |
| 11:04 | 39 | 9 | 0.07 | 0.23 | 0.50 | 0.22-1.09 |
| 08:01 | 0 | 1 | 0.09 | 0.25 | 6.61 | 0.40-∞ |
| 11:11 | 0 | 1 | 0.09 | 0.25 | 6.61 | 0.40-∞ |
| 03:01 | 26 | 5 | 0.12 | 0.28 | 0.43 | 0.16-1.23 |
| 04:05 | 6 | 0 | 0.18 | 0.37 | 0.16 | 0-1.86 |
| 04:08 | 5 | 0 | 0.18 | 0.37 | 0.19 | 0-1.86 |
| 15:02 | 12 | 1 | 0.24 | 0.37 | 0.25 | 0.03-1.62 |
| 10:01 | 11 | 2 | 0.24 | 0.44 | 0.46 | 0.03-1.64 |
| 04:04 | 4 | 4 | 0.26 | 0.46 | 2.22 | 0.40-12.06 |
| 04:03 | 9 | 7 | 0.30 | 0.49 | 1.76 | 0.59-5.10 |
| 04:07 | 3 | 0 | 0.31 | 0.49 | 0.31 | 0-3.34 |
| 13:02 | 12 | 7 | 0.46 | 0.69 | 1.3 | 0.51-3.98 |
| 13:01 | 19 | 6 | 0.51 | 0.74 | 0.72 | 0.21-1.72 |
| 01:01 | 23 | 7 | 0.55 | 0.76 | 0.68 | 0.27-1.67 |
| 08:04 | 2 | 1 | 0.59 | 0.78 | 1.31 | 0.15-30.33 |
| 01:02 | 3 | 2 | 1 | 1 | 1.57 | 0.1-7.80 |
| 04:06 | 1 | 0 | 1 | 1 | 0.72 | 0-11.78 |
| 13:05 | 1 | 0 | 1 | 1 | 0.72 | 0-11.78 |
| 16:02 | 1 | 0 | 1 | 1 | 0.72 | 0-11.78 |
| 08:02 | 2 | 0 | 1 | 1 | 0.43 | 0-11.70 |
| 08:03 | 7 | 3 | 1 | 1 | 1.02 | 0.24-4.15 |
| 11:03 | 6 | 1 | 1 | 1 | 0.50 | 0.07-4.01 |
| 12:01 | 2 | 0 | 1 | 1 | 0.43 | 0-11.71 |
| DRB1 Genotype | Control (n) | PV (n) | a* | b* | c* | d* | p | adj_p | OR | %95 CI |
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 01:01/04:02 | 0 | 5 | 5.5 | 0.5 | 86.5 | 200.5 | 0.0027 | 0.0934 | 25.49 | 1.39 – 466.2 |
| 04:02/04:04 | 0 | 3 | 3.5 | 0.5 | 88.5 | 200.5 | 0.0298 | 0.6670 | 15.85 | 0.81 – 310.28 |
| 04:02/13:01 | 0 | 4 | 4.5 | 0.5 | 87.5 | 200.5 | 0.0091 | 0.2465 | 20.62 | 1.09 – 387.21 |
| 11:04/11:04 | 12 | 1 | 1.5 | 12.5 | 90.5 | 188.5 | 0.0700 | 1 | 0.24 | 0.04 – 1.38 |
| 15:01/15:01 | 7 | 0 | 0.5 | 7.5 | 91.5 | 193.5 | 0.1027 | 1 | 0.14 | 0.007 – 2.49 |
| 07:01/11:01 | 5 | 0 | 0.5 | 5.5 | 91.5 | 195.5 | 0.3294 | 1 | 0.19 | 0.01 – 3.55 |
| Component | Mean_ hDSG3 | Mean_ Aa_P | Difference (Aa_P - hDSG3) |
|---|---|---|---|
| VDWAALS | -66.79 | -109.68 | -42.89 |
| EEL | 38.06 | 36.93 | -1.13 |
| EGB | 3.54 | -0.03 | -3.57 |
| ESURF | -9.85 | -14.72 | -4.87 |
| GGAS | -28.73 | -72.74 | -44.01 |
| GSOLV | -6.31 | -14.75 | -8.44 |
|
|
|
|
|
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutoimmune Bullous Skin Diseases · Skin and Cellular Biology Research · Dermatology and Skin Diseases
Introduction
1
Autoimmune diseases emerge when self-tolerance is lost due to interactions between genetic susceptibility and environmental triggers. Pemphigus vulgaris (PV) exemplifies organ-specific autoimmunity, characterized by pathogenic autoantibodies against desmoglein 3 (DSG3) and desmoglein 1 (DSG1), desmosomal cadherins vital for epithelial adhesion (1, 2). Binding of these antibodies disrupts keratinocyte junctions, causing acantholysis and intraepidermal blistering (3, 4). Although autoantibody pathogenicity is well defined, the mechanisms leading to loss of tolerance remain poorly understood (5). In PV, defects in both central and peripheral tolerance mechanisms are believed to contribute to disease onset. Central tolerance occurs in the thymus for T cells and in the bone marrow for B cells (6, 7). Defects in central and peripheral tolerance may contribute to PV onset. Incomplete deletion of autoreactive T cells in the thymus or B cells in the bone marrow may allow self-reactive clones to persist. Failures in peripheral tolerance mechanisms—such as anergy, regulatory T cell (Treg) suppression, or activation-induced cell death—can permit their activation (3–5, 7). Furthermore, epitope spreading, where immune responses expand from one epitope to others within or across proteins, contributes to disease progression. In PV, this is exemplified by the progression of immune responses from DSG3 to DSG1 epitopes, reflecting a clinical shift from mucosal-limited disease to the mucocutaneous form, which often correlates with increased disease severity and resistance to therapy (8, 9).
PV shows a consistent genetic link with HLA-DRB1*04:02 and *14:01 alleles, reported in multiple populations. These alleles likely facilitate presentation of DSG3 peptides to autoreactive CD4+ T cells (10–12). In addition to genetic predisposition, environmental factors are thought to play a contributory role in triggering or perpetuating the autoimmune response. In addition, environmental exposure may act as a secondary trigger. Molecular mimicry—a phenomenon in which microbial or environmental antigens share structural similarity with self-proteins—has been proposed as one mechanism driving autoimmunity (1, 3, 5, 8, 13). Endemic pemphigus foliaceus (fogo selvagem) in Brazil is one such example where sandfly (Lutzomyia longipalpis) salivary antigen (LJM11) has been implicated as a potential trigger due to its sequence resemblance to DSG1 (13, 14). Among blood-feeding vectors, exposure to Simulium nigrimanum (black fly) bites was shown to correlate with anti-desmoglein titers in an epidemiological study (15). However, no equivalent mimicry mechanism has been demonstrated for PV or DSG3-specific autoimmunity.
Here, we conducted a genetic association study to evaluate HLA-DRB1 allele frequencies in Turkish PV patients and matched controls. Our findings revealed a strong association between PV and the HLA-DRB104:02 and HLA-DRB114:01 alleles, consistent with previous reports in other populations. Building on these observations, we advance a hypothesis in which cadherin-like proteins from the salivary glands of mosquitoes or other hematophagous vectors (e.g., sandflies) may share structural features with DSG3/DSG1. In this proposed “vector-derived cadherin mimicry” (VCM) framework, structural similarity—rather than sequence identity—is considered a central element of potential mimicry. Within this context, repeated or chronic antigen exposure in genetically predisposed hosts could be compatible with scenarios leading to loss of tolerance and the emergence of PV-associated autoimmunity, without implying a deterministic or universal mechanism. To explore the structural and immunogenetic plausibility of this framework, we integrated HLA genotyping, peptide-binding predictions, structural modeling, and molecular dynamics analyses.
Materials and methods
2
Study population
2.1
This case–control study included patients diagnosed with pemphigus vulgaris (PV) at the Bullous Diseases outpatient clinic, Department of Dermatology and Venereology, Karadeniz Technical University (KTU). Diagnosis was confirmed by clinical findings, histopathology, direct immunofluorescence, and/or ELISA. Only patients without other autoimmune diseases who provided informed consent were included. In total, 86 PV patients (aged 21–81 years) and 200 age- and sex-matched healthy controls participated. Peripheral blood samples (5–9 ml) were collected in EDTA tubes for genomic DNA extraction using the MiniFavorPrep™ kit (Favorgen, Australia). The study was approved by the KTU Faculty of Medicine Ethics Committee (protocol no: 2022/51).
HLA-DRB1 genotyping and statistical analysis
2.2
HLA-DRB1 typing was performed by sequence-specific oligonucleotide probe (PCR-SSOP) using the Luminex platform (LABScan3D, One Lambda, USA) and analyzed with HLA Fusion software. The One Lambda SSP^®^ kit (Thermo Fisher Scientific) was used for allele identification (16, 17). Power analysis was conducted using the PGA v2.0 program to ensure 80% statistical power software (18). Hardy–Weinberg equilibrium in controls was tested with GENEPOP (19). Allele and genotype frequencies were compared using Pearson’s chi-square or Fisher’s exact test when expected counts were <5, implemented in RStudio (v2024.04.2). Significance was adjusted with the Benjamini–Hochberg FDR method; p_adj < 0.05 was considered significant.
Identification of cadherin-like homologs and antigenic peptide prediction
2.3
To identify candidate cadherin-like proteins with potential for molecular mimicry, we performed BLASTP searches against the VectorBase database (https://vectorbase.org/vectorbase/app) using the EC1 domain of human DSG3 as the query sequence. We employed PSI-BLAST to increase sensitivity for detecting structurally conserved and evolutionarily related sequences. PSI-BLAST utilize position-specific scoring matrices to capture deeper pattern-level homology not evident from linear sequence identity alone. The top-scoring result was a cadherin-like protein from Aedes albopictus, annotated based on homology to Aedes aegypti (protein: AAEL000597; Gene symbol: LOC5564848/neural-cadherin) (Supplementary File). This protein, with a predicted length of 1569 amino acids, was selected for further analysis based on its high similarity score and cadherin domain architecture. From the BLAST alignment, we extracted the homologous region to human DSG3 ectodomain 1 (hDSG3 EC1) and converted it to FASTA format. This sequence was subjected to MHC class II peptide-binding predictions using the IEDB MHC-II binding tool (www.iedb.org), applying the same parameters used for hDSG3. Based on our and other genetic association findings, HLA-DRB104:02 and HLA-DRB114:01 were selected as the MHC class II alleles of interest. Candidate antigenic peptides were prioritized by (1): strong predicted binding affinity (2), high immunogenicity scores, and (3) maximal alignment similarity with hDSG3 EC1 sequences (20, 21).
Structural modeling, molecular dynamics and binding free energy calculations
2.4
Structural modeling of EC1 peptides from hDSG3 and Aedes albopictus cadherin-like protein was performed using AlphaFold3 and AlphaFoldMultimer tools (22, 23) on the Colab platform (24). Molecular models were visualized using UCSF ChimeraX (v1.10) (25). All-atom molecular dynamics (MD) simulations were carried out using GROMACS (v2022.5) with the AMBER99SB-ILDN force field. Systems were solvated with TIP3P water, neutralized, energy-minimized, equilibrated, and subjected to a 100 ns production run at 298 K and 1 atm using the V-rescale thermostat and Parrinello–Rahman barostat (26). MD trajectories were analyzed to characterize structural stability and residue-level flexibility of the peptide–HLA complexes. Peptide backbone root mean square deviation (RMSD) was calculated following alignment of each trajectory frame to the HLA backbone atoms. Root mean square fluctuation (RMSF) was computed for peptide residues using the equilibrated portion of the trajectories. For all analyses, the final 47.5 ns of the 100 ns production simulations were considered. Detailed trajectory analysis procedures, including atom selection criteria and alignment settings, are provided in the Supplementary Methods.
Binding free energies (ΔG_binding) for peptide–protein complexes were estimated using gmx_MMPBSA (v1.5.7) with the MM/GBSA approach (GB-OBC II, igb=5). Representative frames from the equilibrated trajectories were used for binding energy estimation and per-residue energy decomposition analyses (27, 28). All additional parameters, scripts, and analysis settings are detailed in the Supplementary File.
Results
3
HLA-DRB1 alleles and genotype statistics
3.1
In the control group, HLA-DRB1 alleles were in Hardy–Weinberg equilibrium. A total of 33 distinct HLA-DRB1 alleles were detected across all participants (Table 1). The ten most frequent alleles in PV patients and controls are listed in Table 1. The most common alleles observed in PV patients were HLA-DRB104:02 (40%) and HLA-DRB114:01 (18%). In controls, the predominant alleles were HLA-DRB111:01, 11:04, and 07:01 (each approximately 9%). Allele frequencies and corresponding statistical analyses are summarized in Table 2. Carriage of HLA-DRB104:02 was significantly associated with PV in this cohort (OR = 12.95; 95% CI: 7.40–24.57). Similarly, HLA-DRB114:01 was more frequent among PV patients compared with controls (p = 1.23×10^-5^; OR = 4.55; 95% CI: 2.35–8.74). Genotype analysis was subsequently performed to evaluate the co-occurrence of both alleles (HLA-DRB104:02/14:01). Genotype frequencies and statistical results are shown in Table 3. The heterozygous HLA-DRB1*04:02/14:01 genotype was observed in PV patients but not detected among controls. This distribution reflects an enrichment of this genotype in the patient group within the studied cohort; however, estimates based on low genotype counts should be interpreted with caution.
hDSG3 EC1 and Aedes albopictus EC1-like domain antigenic peptide prediction
3.2
MHC class II peptide-binding predictions performed using the IEDB tool identified 19 unique predicted peptides within the hDSG3 EC1 domain and 167 within the A. albopictus EC1-like domain (Aa_P) (Supplementary Table S1). From these predicted peptides, candidates were prioritized based on a combination of predicted binding affinity, immunogenicity score, and sequence similarity to the hDSG3 EC1 domain (Figure 1).
Comparative analysis of the EC1 domains of hDSG3, hDSG1 and (A)albopictus cadherin-like protein reveals a conserved antigenic motif. (A) Schematic representation of the EC1 domain of hDSG3, hDSG1 and (A)albopictus, showing the most antigenic 16-mer peptides identified using the IEDB MHC-II binding prediction tool (highlighted in yellow). (B) Multiple sequence alignment of hDSG3, hDSG1 and EC1 domains with the (A)albopictus EC1-like domain. The highlighted regions correspond to the predicted peptides shown in panel (A) Alignment was performed using Clustal Omega, and conserved residues are indicated below the sequences. (C) Sequence logo comparing the selected peptides from Aa_P (MTGAIYVAGALDYETR) and hDSG3 (NTGDINITAIVDREET). The height of each letter reflects the relative frequency and conservation of the residue at that position. Coloring follows the ClustalX scheme, with residues colored based on their physicochemical properties: hydrophobic (green), small/hydroxyl (orange), acidic (red), basic (blue), polar (cyan), aromatic (magenta), and others (gray)—sequence logo generated using the Python-based Logomaker library (v0.8) in Google Colab.
Structural modeling and binding free energy analysis
3.3
Structural comparison of the modeled hDSG3 and A. albopictus EC1 domains revealed a high degree of similarity in overall domain architecture and secondary structure (Figures 2A-C; Supplementary File). Following docking of the predicted peptides to HLA-DRB1*04:02, stable receptor–ligand complexes were obtained for both peptide–HLA pairs (Figure 3). Subsequent molecular dynamics simulations indicated comparable binding poses for the two peptides throughout the analyzed trajectories. Despite these structural similarities, calculated binding free energies differed between the two complexes (Table 4).
Structural comparison and alignment of EC1 cadherin domains from hDSG3 and Aedes albopictus. (A) Ribbon representation of the extracellular cadherin 1 (EC1) domain from human desmoglein-3 (hDSG3_EC1_CAD), predicted using AlphaFold. (B) Ribbon representation of the EC1-like domain from Aedes albopictus cadherin-like protein (Aa_P_CAD), based on AlphaFold modeling and sequence homology to the human DSG3 EC1 domain. (C) Structural overlay of the two domains, highlighting conserved β-strand architecture and overall topological similarity. Overlay was generated by pairwise structural alignment using UCSF ChimeraX (v1.10), employing default Needleman–Wunsch alignment and backbone RMSD minimization. Color scheme: hDSG3 (green), Aa_P (cyan). The high degree of β-sheet preservation supports the hypothesis that vector-derived cadherin-like domains may mimic key conformational features of desmogleins, potentially enabling molecular mimicry in the context of HLA-mediated antigen presentation.
Surface representation of the HLA-DR heterodimers (A) Molecular surface rendering of the HLA-DR heterodimer complex comprising the α-chain (HLA-DRA, blue) and β-chain (HLA-DRB104:02, green) bound to a representative peptide derived from the EC1 domain of human desmoglein 3 (hDSG3). The peptide is shown as a stick model within the peptide-binding groove. (B) Equivalent surface rendering of the same HLA-DR heterodimer bound to a predicted immunogenic peptide from the EC1-like domain of a cadherin-like protein in Aedes albopictus (Aa_P peptide). The vector-derived peptide adopts a similar binding pose within the groove, demonstrating structural compatibility. Models were generated using AlphaFold-Multimer and visualized with UCSF ChimeraX (v1.10). Structural alignments and docking were manually refined to preserve canonical MHC class II geometry. Conserved binding orientation and similar residue positioning between peptides support the hypothesis of structural mimicry between vector-derived and self-antigens, relevant to HLA-DRB104:02–restricted immune responses.
Molecular dynamics stability and flexibility analyses
3.4
The dynamic stability of the peptide–HLA complexes was evaluated by peptide backbone root mean square deviation (RMSD) analysis following alignment to the HLA backbone (Figure 4A). RMSD traces calculated over the equilibrated portion of the trajectories (last 47.5 ns) indicated that both the hDSG3 EC1-derived peptide and the Aa_P peptide fluctuated within comparable RMSD ranges, without evidence of sustained structural drift. Occasional transient RMSD excursions were observed for the hDSG3 EC1-derived peptide during the analyzed window; however, these fluctuations were short-lived and did not result in a sustained increase in RMSD, supporting comparable backbone stability of both peptides under the simulated conditions. Residue-level flexibility of the bound peptides was assessed by root mean square fluctuation (RMSF) analysis after alignment to the HLA backbone (Figure 4B). Both the Aa_P and hDSG3 EC1-derived peptides exhibited low RMSF values across the central region of the peptide sequence, indicating constrained fluctuations within the HLA binding groove. Increased flexibility was primarily observed at the N- and C-terminal residues for both peptides, a pattern commonly associated with reduced anchoring at peptide termini. Overall, the RMSF profiles of the two peptides were broadly similar, with only modest residue-specific differences. No pronounced or regionally distinct increases in flexibility were observed for either peptide, suggesting comparable dynamic accommodation within the HLA binding site under the simulated conditions.
Molecular dynamics stability and flexibility analyses of peptide–HLA complexes. (A) Peptide backbone RMSD profiles for the hDSG3 EC1-derived peptide and the Aa_P peptide, calculated over the equilibrated phase of the MD simulations (last 47.5 ns of 100 ns), following alignment to the HLA backbone. Both peptides exhibit comparable RMSD ranges without evidence of sustained structural drift. (B) Peptide RMSF profiles calculated over the equilibrated portion of the trajectories. Both peptides show low to moderate residue-level fluctuations within the HLA binding groove, with increased flexibility primarily confined to terminal residues.
In silico evaluation of the VCM hypothesis
3.5
To assess the plausibility of the VCM hypothesis, we constructed HLA-DRA/DRB104:02–peptide complexes and performed peptide-binding predictions, structural modeling, and molecular dynamics simulations. Binding free energy (ΔG_binding) calculations indicated that the Aa_P peptide derived from A. albopictus exhibited a more favorable binding energy than the hDSG3 EC1-derived peptide (Table 4), suggesting a high affinity for the risk-associated HLA-DRB104:02 molecule. Structural similarity and comparable binding modes support the potential for molecular mimicry (Figures 2, 3). These computational results provide preliminary support for the VCM hypothesis and guided the development of a mechanistic model (Figure 5).
Proposed model of Vector-derived Cadherin Mimicry (VCM) in pemphigus vulgaris. This schematic illustrates a hypothesis-generating, non-deterministic model describing how exposure to hematophagous vector–derived cadherin-like proteins could contribute to PV in genetically susceptible individuals. It is intended to illustrate a conceptual framework supported by in silico analyses and existing transcriptomic observations, highlighting testable hypotheses for future experimental investigation. 1. Aedes albopictus introduces salivary components into the host skin during blood feeding. 2. Vector saliva, together with tissue injury at the bite site, leads to deposition of salivary molecules and potential cadherin-like peptide candidates within the dermis. 3. Local tissue perturbation and cellular stress may result in the release of damage-associated molecular patterns (DAMPs) and, in some contexts, pathogen-associated molecular patterns (PAMPs) derived from microbial co-inoculation. 4. Resident antigen-presenting cells (APCs), including dendritic cells, encounter vector-derived peptides in an inflammatory microenvironment shaped by tissue damage and innate immune cues. 5. Innate immune sensing occurs through pattern-recognition receptors (e.g., Toll-like receptors), responding to DAMPs/PAMPs, promoting APC activation and cytokine production. 6. Vector-derived cadherin-like peptides (Aa_P) are internalized by APCs and processed via the endosomal pathway. 7. Processed peptides are presented as peptide–HLA class II complexes, including complexes involving PV-associated alleles such as HLA-DRB104:02 and HLA-DRB114:01, and recognized by CD4+ T cells through the T-cell receptor (TCR). 8. Activated CD4+ T cells provide cytokines and co-stimulatory signals that support B-cell activation. 9. B cells recognizing vector-derived cadherin-like peptides undergo clonal expansion and differentiation, including germinal-center processes such as somatic hypermutation and class-switch recombination. 10. Plasma cells initially produce antibodies directed against vector-derived antigens. 11. Through molecular mimicry and affinity maturation, a subset of antibodies may acquire cross-reactivity toward desmoglein 3 (DSG3) and/or desmoglein 1 (DSG1). 12. Anti-desmoglein autoantibodies bind to desmosomal cadherins on keratinocyte surfaces in the epidermis. 13. Autoantibody binding interferes with desmoglein-mediated cell–cell adhesion, contributing to acantholysis and intraepidermal blister formation characteristic of PV.
Discussion
4
Pemphigus vulgaris (PV) is a potentially life-threatening autoimmune blistering disease, and the mechanisms underlying the development of autoantibody responses against desmogleins remain incompletely understood. The disease is widely considered to arise from the interplay between genetic susceptibility and environmental factors, with human leukocyte antigen (HLA) class II genes representing the strongest and most consistently replicated genetic contributors. Multiple independent cohort studies have implicated specific HLA-DRB1 alleles, particularly HLA-DRB104:02 and HLA-DRB114:01, as being associated with increased susceptibility to PV (29, 30). The prevalence of HLA-DRB104:02 has been reported to range from approximately 33% to over 90% across different PV populations, while HLA-DRB114:01 has been observed in 28–44% of patients in several cohorts (30).
In the present study, HLA-DRB104:02 and HLA-DRB114:01 were detected in 40.7% and 18% of PV patients, respectively, findings that are broadly consistent with previously reported ranges. The observed frequency of HLA-DRB114:01 in our cohort was lower than that reported in several other populations (Table 1). In contrast, HLA-DRB116:01 and HLA-DRB111:01 were less frequent among PV patients than in controls, suggesting a possible inverse association with disease susceptibility in this cohort (Table 2). Previous studies investigating HLA associations in PV have predominantly relied on allele- or haplotype-based analyses (31–36). In the present cohort, genotype-level analysis revealed that the heterozygous HLA-DRB104:02/14:01 genotype was observed exclusively among PV patients and not detected in controls. This distribution resulted in a high odds ratio estimate (OR: 114.97; 95% CI: 6.86–1925.82); however, the wide confidence interval reflects the low frequency of this genotype and indicates substantial statistical uncertainty (Table 3). Unlike typical genetic association studies that focus on common bi-allelic single-nucleotide polymorphisms (SNPs) (37), HLA-DRB1 is highly polymorphic, comprising hundreds of distinct alleles. This extensive allelic diversity gives rise to numerous rare genotype combinations, which substantially limits statistical power in small- to moderate-sized cohorts. For this reason, most previous studies have emphasized allele- or haplotype-level associations rather than genotype-level patterns. While our findings suggest a notable enrichment of a specific heterozygous genotype within this cohort, confirmation of genotype-level associations will require evaluation in larger and independent populations with sufficient genotype counts.
Collectively, these observations reinforce the central role of HLA-DRB1 variation in PV susceptibility while underscoring that genetic predisposition alone is unlikely to be sufficient for disease development. A range of environmental and microbial factors—including diet, medications, and infections—has been proposed as potential contributors to disease initiation (5, 38–40), yet none has been conclusively shown to account for the initial breach of immune tolerance. In the context of endemic pemphigus foliaceus (Fogo selvagem), exposure to salivary antigens from hematophagous insects, such as sandflies and blackflies, has been proposed as a possible environmental contributor to the development of anti-DSG1 autoantibodies (13, 14, 39). Building on this conceptual framework, and to explore the plausibility of the proposed VCM hypothesis, we applied an integrative in silico approach combining HLA genotyping, peptide-binding prediction (Figure 1, Supplementary File), structural modeling (Figures 2, 3), and molecular dynamics simulations (Table 4). These analyses were designed to assess whether cadherin-like proteins derived from hematophagous vectors could exhibit structural and immunological similarity to hDSG3 that may be relevant in individuals carrying PV-associated HLA-DRB1 alleles. We propose that repeated exposure to hematophagous vectors may represent a plausible environmental context in which cadherin-like proteins originating from salivary gland epithelial cells become immunologically relevant to the host in genetically predisposed individuals (Figure 5). The salivary glands of A. albopictus consist of a single layer of secretory epithelial cells, typically cuboidal or columnar, organized into tubular acini and ducts (41). As in other epithelial tissues, these cells express cadherins at their apicolateral membranes to maintain epithelial integrity and polarity (42, 43). During blood feeding, mechanical stress, physiological cell turnover, or cellular damage associated with apoptosis, necrosis, or infection could allow membrane-associated cadherin-like fragments to become accessible within the salivary milieu. While the presence of cadherin transcripts (Supplementary Table S3) and epithelial integrity mechanisms in mosquito salivary glands is well supported, direct evidence for the abundance, stability, and immunogenicity of cadherin-derived proteins within mosquito saliva remains limited. Accordingly, this proposed pathway is presented as a biologically plausible, yet hypothesis-generating model rather than a demonstrated pathogenic mechanism.
Conceptual framework: from vector-associated immune priming to desmoglein-targeted autoimmunity
4.1
In genetically susceptible individuals (e.g., carriers of HLA-DRB1*04:02 or *14:01), repeated exposure to cadherin-like peptide candidates potentially present in the saliva of Aedes albopictus or other hematophagous insects (e.g., sandflies) may be associated with a subclinical or priming immune response against these foreign antigens. One plausible scenario involves immune priming during vector bites, in which tissue injury or microbial co-inoculation generates damage-associated molecular patterns (DAMPs) or pathogen-associated molecular patterns (PAMPs). These signals may promote dendritic cell activation and favor the differentiation of naïve T cells toward immunogenic rather than tolerogenic phenotypes. In the presence of sequence and structural similarity to hDSG3 and hDSG1, certain B- and T-cell clones may therefore exhibit features consistent with molecular mimicry and potential cross-reactivity. Over time, somatic hypermutation, affinity maturation, and class-switch recombination within germinal centers (44) may enhance the specificity and binding strength of these B-cell receptors, potentially increasing the likelihood of recognition of self-antigens (DSG3/DSG1) (45). Initial tissue damage may further expose cryptic desmoglein epitopes, which are processed and presented by antigen-presenting cells, thereby promoting epitope spreading. This process could drive clonal expansion of autoreactive B cells and facilitate a transition from vector-derived immune recognition to sustained autoimmunity. Ultimately, persistent autoantibody production against DSG3 and DSG1 could contribute to disruption of keratinocyte adhesion, consistent with mechanisms underlying blister formation in pemphigus vulgaris (Figure 5).
To test our VCM hypothesis, at least in silico, we employed a combination of generating HLA-DRA/DRB104:02 complexes, peptide-binding predictions, structural modeling, and molecular dynamics analyses (Table 4). Our analysis revealed that the Aa_P peptide exhibited lower estimated binding free energy than the hDSG3 EC1-derived peptide in our in silico analyses. These data provide preliminary support for the VCM hypothesis, although experimental validation is required. Additional contextual support for the plausibility of the hypothesis comes from transcriptomic data reported by Sim and colleagues, indicating regulation of cadherin-related transcripts in the salivary glands of Aedes aegypti (46) (Supplementary Table S3). For experimental validation, an appropriate animal model is essential. Humanized mice expressing the HLA-DRB104:02 allele can be utilized, as previously demonstrated by Eming et al. in a preclinical mouse model of PV (47). The VCM hypothesis could be conceptually evaluated in future studies using these mice with both recombinant hDSG3 ectodomains and A. albopictus–derived cadherin-like (CAD-like) peptides (Figures 1A, 2B).
Vector-associated mimicry in pemphigus: lessons from endemic pemphigus foliaceus and implications for pemphigus vulgaris
4.2
In regions such as Brazil and Tunisia, endemic pemphigus foliaceus (EPF)—including its localized form Fogo Selvagem (FS)—has long been suspected to arise from environmental exposures, particularly to insect vectors. Multiple studies have identified associations between anti-DSG1 autoantibodies and exposure to vector-borne antigens. For example, in Brazil, Lutzomyia longipalpis (the sandfly vector of visceral leishmaniasis) was shown to elicit anti-Dsg1 IgG4 antibodies through salivary proteins such as LJM11 and LJM17, which cross-react with epitopes on human Dsg1. In murine models, immunization with LJM11 induced anti-Dsg1 antibodies, lending support to the hypothesis of vector-induced molecular mimicry in genetically susceptible individuals (e.g., HLA-DRB104:04 carriers) (12–15). Similarly, in Tunisia, anti-Dsg1 antibodies have been detected in sera of patients with leishmaniasis and hydatidosis (48). Salivary proteins such as PpSP32 from Phlebotomus papatasi were also shown to bind both Dsg1 and Dsg3 in vitro. Interestingly, although vector antigens triggered desmoglein-recognizing antibodies in mice, definitive cross-reactivity and pathogenic potential remain unconfirmed. These observations suggest a possible mimicry mechanism but fall short of providing direct evidence for pathogenic antibody induction and acantholysis (13, 14, 48, 49). While these studies contribute valuable epidemiological and immunological insights, they do not fully explain the molecular mechanisms of epitope recognition or autoimmune activation. Notably, sandfly-derived proteins studied thus far have not been conclusively shown to generate immunodominant or acantholytic responses, and structural mimicry has not been conclusively demonstrated through high-resolution computational or crystallographic methods (48, 50). Against this backdrop, our current study advances the field by focusing not on salivary proteins per se, but rather on cadherin-like proteins encoded by the vectors themselves, particularly those with structural homology to the EC1 domain of human Dsg3—the primary autoantigen in PV. This novel approach addresses a major gap in the literature by proposing that vector-derived cadherin homologs may act as molecular mimics of hDsg3. Unlike earlier research, our study integrates peptide–HLA binding predictions, homology modeling, and molecular dynamics simulations to explore the plausibility of cross-reactive interactions. By centering on HLA-DRB104:02, a well-established susceptibility allele for PV, we propose a mechanistically and structurally coherent framework for how vector-derived proteins may break immune tolerance and initiate autoimmunity in genetically predisposed individuals.
Limitations
4.3
Several limitations of the present study should be acknowledged. First, although statistically significant associations were observed for HLA-DRB104:02 and HLA-DRB114:01 alleles, genotype-level analyses—particularly for the heterozygous HLA-DRB1*04:02/14:01 genotype—were based on low-frequency observations. As a consequence, estimates of effect size for rare genotypes are subject to statistical uncertainty and imprecision, and confidence intervals should be interpreted with caution. Under conditions of sparse data, odds ratio estimates may be inflated and do not necessarily reflect stable population-level effects.
Second, the study cohort size, while sufficient to detect allele-level associations, limits the robustness of genotype-level inference. Confirmation of the observed genotype-level enrichment will therefore require replication in larger, independent cohorts and in populations with different genetic backgrounds.
Third, the molecular dynamics simulations and binding free energy analyses were conducted over a finite simulation timescale and were designed for comparative, hypothesis-generating purposes. These in silico analyses do not provide direct evidence of pathogenicity, immunogenicity, or causal involvement in disease initiation, and should be interpreted as supportive structural and energetic observations rather than definitive mechanistic proof.
Finally, the proposed Vector-derived Cadherin Mimicry (VCM) framework is conceptual in nature. While it integrates immunogenetic associations with structural modeling, experimental validation—using appropriate cellular or animal models—will be necessary to assess the biological relevance of this hypothesis and its potential contribution to pemphigus vulgaris pathogenesis.
In conclusion, while vector-associated molecular mimicry has been most extensively explored in the context of endemic pemphigus foliaceus, the present study proposes a computationally informed and structurally plausible extension of this conceptual framework to PV, with a shift in focus from DSG1 to DSG3. Rather than establishing a definitive mechanistic link, this hypothesis expands existing models of vector-associated autoimmunity by integrating immunogenetic context with structural bioinformatics. To the best of our knowledge, this study also reports a significant enrichment of the heterozygous HLA-DRB1*04:02/14:01 genotype in a PV cohort, although confirmation of genotype-level effects will require validation in larger, independent populations. In parallel, our analyses suggest a previously underexplored connection between genetic susceptibility and repeated environmental exposure to vector-derived cadherin-like proteins. Through the integration of immunogenetic data, structural modeling, and comparative energetic analyses, we provide preliminary in silico observations consistent with the proposed VCM framework. If supported by future experimental studies, this model could contribute to a broader understanding of autoimmune pathogenesis, in which hematophagous insect vectors may represent context-dependent environmental factors rather than direct causal agents. Such insights may ultimately inform HLA-based risk stratification approaches and guide the development of targeted preventive strategies in genetically susceptible populations.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Amagai M Stanley JR . Desmoglein as a Target in Skin Disease and Beyond. J Invest Dermatol. (2012) 132:776–84. doi: 10.1038/jid.2011.390, PMID: 22189787 PMC 3279627 · doi ↗ · pubmed ↗
- 2Stanley JR Amagai M . Pemphigus, Bullous Impetigo, and the Staphylococcal Scalded-Skin Syndrome. N Engl J Med. (2006) 355:1800–10. doi: 10.1056/NEJ Mra 061111, PMID: 17065642 · doi ↗ · pubmed ↗
- 3Amagai M . The molecular logic of pemphigus and impetigo: the desmoglein story. Vet Dermatol. (2009) 20:308–12. doi: 10.1111/j.1365-3164.2009.00831.x, PMID: 20178466 · doi ↗ · pubmed ↗
- 4Payne AS Hanakawa Y Amagai M Stanley JR . Desmosomes and disease: pemphigus and bullous impetigo. Curr Opin Cell Biol. (2004) 16:536–43. doi: 10.1016/j.ceb.2004.07.006, PMID: 15363804 · doi ↗ · pubmed ↗
- 5Moro F Sinagra JLM Salemme A Fania L Mariotti F Pira A . Pemphigus: trigger and predisposing factors. Front Med. (2023) 10:1326359. doi: 10.3389/fmed.2023.1326359, PMID: 38213911 PMC 10783816 · doi ↗ · pubmed ↗
- 6Didona D Di Zenzo G . Humoral Epitope Spreading in Autoimmune Bullous Diseases. Front Immunol. (2018) 9:779. doi: 10.3389/fimmu.2018.00779, PMID: 29719538 PMC 5913575 · doi ↗ · pubmed ↗
- 7Tangye SG Ma CS Brink R Deenick EK . The good, the bad and the ugly — TFH cells in human health and disease. Nat Rev Immunol. (2013) 13:412–26. doi: 10.1038/nri 3447, PMID: 23681096 · doi ↗ · pubmed ↗
- 8Vanderlugt CL Miller SD . Epitope spreading in immune-mediated diseases: implications for immunotherapy. Nat Rev Immunol. (2002) 2:85–95. doi: 10.1038/nri 724, PMID: 11910899 · doi ↗ · pubmed ↗
