singIST: An integrative method for comparative single-cell transcriptomics between disease models and humans
Aitor Moruno-Cuenca, Sergio Picart-Armada, Rachael Bogle, Jennifer Fox, Lam C. Tsoi, Johann Eli Gudjonsson, Alexandre Perera-Lluna, Francesc Fernández-Albert

TL;DR
singIST is a new method that compares single-cell gene activity in disease models to human data, helping researchers understand how well models mimic human diseases.
Contribution
singIST introduces an explainable framework for comparing single-cell transcriptomics between disease models and human conditions at multiple biological levels.
Findings
singIST identifies which models best mimic human disease at pathway, cell type, and gene levels.
In atopic dermatitis models, singIST recovers known biology and highlights model-specific divergences.
T-cell stimulation in hidradenitis suppurativa models improves only pre-existing well-matched pathways.
Abstract
Disease models are fundamental tools in drug discovery and early-stage drug development, but they only approximate human disease, and selecting a suitable model is challenging. Quantitative computational methods exist to assess molecular resemblance to human conditions, but approaching that work at single-cell resolution, and doing so in an explainable and generalizable way, remain very limited. We present singIST, a computational method for comparative single-cell transcriptomics analysis between disease models and human conditions. singIST provides explainable quantitative measures on disease model similarity to the human reference at the pathway, cell type and gene levels. These measures jointly account for gene orthology, cell type presence in the model, cell type and gene importance in the human condition, and gene level fold changes in the model, within a unifying framework that…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Fig 1
Fig 2
Fig 3
Fig 4
Fig 5
Fig 6
Fig 7- —http://dx.doi.org/10.13039/501100010198Ministerio de Asuntos Económicos y Transformación Digital, Gobierno de España
- —http://dx.doi.org/10.13039/501100010198Ministerio de Asuntos Económicos y Transformación Digital, Gobierno de España
- —Doctorats Industrials Agència de Gestió d'Ajuts i Universitaris i de Recerca
- —NIH
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHidradenitis Suppurativa and Treatments · Psoriasis: Treatment and Pathogenesis · Acne and Rosacea Treatments and Effects
1. Introduction
Disease models are biological experimental systems to study human disease. These models are designed to mimic the pathophysiology, progression, and response to treatments observed in human conditions. These models serve as the backbone to drug discovery and early drug development activities; drug target validation and characterization [1]; compound screening [2,3]; preclinical studies to identify a lead candidate from several targets, select optimal formulation, posology and route of administration [4]; and guide early phase clinical trial design [5,6]. However, the validation of molecular physiology, aetiology and pathogenesis of disease models to that of human condition remain a challenge, contributing to high rates of drug development attrition [7].
Recently, there have been methodological advancements in bioinformatics to quantitatively assess the validity of disease models in mimicking a human condition, through bulk transcriptomics. Found In Translation (FIT) [8] is a statistical methodology, relying on regularized linear regression models, that leverages bulk transcriptomics data to extrapolate murine disease models’ gene expression to expression changes that would be equivalent in the human condition, by using disease models’ Fold Changes (FC). Another approach is In Silico Treatment (IST) [9], a computational method that assesses translation of disease-related bulk gene expression patterns between animal models and humans, by also simulating observed disease models’ FC in humans, providing an interpretable measure of their transcriptomics similarity. Nonetheless, evaluating disease models using bulk transcriptomics methods may lack the necessary granularity to underpin changes in specific cell populations involved in the pathological manifestations of the human condition. This is particularly true for Immune-mediated inflammatory diseases (IMIDs), whose pathogenesis is primarily driven by lymphoid cells [10,11]. Neither FIT nor IST provides a trivial approach to accommodate single-cell data. Current methodologies for comparative analysis of single-cell transcriptomic changes in disease models to that of human condition are scarce. A recurrent approach is to perform an Overlapping Differentially Expressed Genes (ODEGs) analysis between disease models and human condition [12–14], yet ODEGs have been proven to be suboptimal as this analysis treats every gene direction and magnitude as equal importance posing the need for more sophisticated approaches [15]. Another strategy is performing a dimensionality reduction technique (CCA, NNMF, tSNE) on disease models’ and human scRNA-seq data and comparing the obtained latent factors [16–18], which poses difficulty in interpreting and quantifying the similarity between both.
To address the challenges in single-cell transcriptomics analysis, we introduce singIST, a flexible computational method built on the foundation of IST. singIST facilitates comparative analysis between disease models and human conditions by accounting for orthology, cell type agreement, adaptive sparsity, and the importance of genes and cell types. It provides interpretable measures of transcriptomic similarity at different levels of granularity. We demonstrate the potential of singIST first in three well-characterized murine models against Atopic Dermatitis (AD), and then in a second use case: Hidradenitis Suppurativa (HS) skin explant cultures ex vivo with and without CD3/CD28 stimulation. In addition, we performed two simulation studies to verify the implementation and properties, and to compare singIST against a naïve ODEGs based approach.
2. Materials and methods
2.1 Materials
2.1.1 Atopic Dermatitis: Evaluation of mouse models.
Moderate-to-severe AD patients single-cell RNA-seq data were obtained from [19], including 4 healthy control (HC) and 5 AD skin suction-blister samples. Cell types followed the original annotations (T cells, melanocytes, dendritic cells, Langerhans cells, and keratinocytes).
Three epicutaneously sensitized murine models that develop an AD-like eczematous phenotype were analysed: Oxazolone (OXA) and Imiquimod 5% cream (IMQ) from [20], and Ovalbumin (OVA) from [21]. For each model and its respective controls, three ear-skin biopsy replicates were available. Details on preprocessing, pseudobulk construction and cell-type harmonization across species, as well as GEO accession numbers and sample metadata, are provided in S1, S2, and S4 Files.
2.1.2 Hidradenitis Suppurativa: Evaluation of human skin explants ex vivo.
Moderate-to-severe HS patient single-cell RNA-seq data were obtained from a previously published study of lesional and healthy control skin biopsies (8 HS and 8 HC biopsies; [22]). For singIST, we restricted the analysis to immune cell populations (myeloid cells, T-cells and mast cells). A detailed description of clinical characteristics, sequencing and preprocessing for this dataset is provided in S1 File.
In addition, we generated an ex vivo HS explant dataset. Six-millimetre punch biopsies from HS lesional skin and healthy control skin were allocated to three experimental groups: healthy control cultured with DMSO (HC DMSO), HS cultured with DMSO (HS DMSO), HS cultured with DMSO plus anti-CD3/CD28 antibodies (HS CD3/CD28). Each group included three independent biological replicates, and all biopsies were cultured for 60 hours before processing. Single-cell RNA-seq libraries were processed using the same pipeline as for the published HS dataset, and only myeloid cells, T-cells and mast cells were retained for analysis. Experimental details, preprocessing steps and sample-level information are described in S1 File.
2.1.3 Pathway data.
For AD, we selected the pathways from [23] that were significantly enriched in moderate-to-severe AD versus HC, and retrieved the corresponding curated gene sets from MsigDB v7.5. For HS, pathways were selected based on GSEA enrichment in human HS versus HC skin. Database sources are provided in S1 File.
2.2 Methods
2.2.1 singIST method.
The objective of this section is to introduce singIST framework for quantifying the agreement in single-cell transcriptomic changes between a human reference and a disease model. A summary of notation used throughout this section is shown in Table 1, and a graphical summary of the full singIST procedure is shown in Fig 2.
Table 1: Summary table of notations defined in singIST method. S1 File has further details on dimensions.
Representation of asmbPLS-DA for scRNA-seq readouts.The response vector Y contains samples as rows, one-hot encoded. Predictor blocks Cb are defined by cell types, with columns representing genes and rows representing samples. Each element within these predictor blocks is the pseudobulk of gene expression values. The figure displays predictor blocks ωgjb and the predictor superblock ωbjsuper weights, as well as scores for the response matrix uij, predictor blocks tijb, and the superblock tijsuper. Created in BioRender. Moruno Cuenca, A. (2026) https://BioRender.com/l36t9i6.
We start by defining the three inputs of singIST: superpathways, human scRNA-seq, and disease model scRNA-seq . First, we define the concept of a superpathway as a set containing cell types and genes. We name as the cell types of interest, previously identified and annotated. For each superpathway , there is a gene set extracted from a pathway of interest p, from which gene subsets are derived for the cell types where . Each superpathway is formally defined as , with the complete set of superpathways representing all pathways under evaluation. singIST method runs independently for each of the superpathways; hence, without loss of generality, from now on we fix a superpathway .
Second, we structure the human scRNA-seq data according to the superpathway, after batch correction and normalization using standard workflows. Let be the block of matrices containing the pseudobulk log-normalized expression for each cell type. Each matrix is defined element-wise , where is the pseudobulk gene expression of human sample i for gene g in cell type b. We define the target class as the human experimental group that the disease model is intended to mimic (i.e., disease), and the base class as the human experimental group that should be differentiated from the target class (i.e., healthy control). Let be the response binary vector denoting human sample class, defined element-wise , where elements
Third, we assume there are disease models to be assessed against human scRNA-seq data for each superpathway. Since singIST runs independently for each disease model, without loss of generality we fix a disease model l. We structure the disease model scRNA-seq FC as blocks of vectors. Let be the block of vector containing the FC between target class and base class of disease model samples. Each vector is defined element-wise , where denotes the human gene subset with its equivalent gene organism symbols for the disease model. The are computed through Eq (1).
Where (difference of mean expression; i.e., ratio of geometric means of counts+1) and are the disease model and adjusted p-value, respectively, of target class versus base class. Details on the computation are in S1 File.
With the three inputs required for singIST now defined, we proceed to outline the steps of the method. The first step is to quantify the transcriptomic changes in the human reference for the superpathway, which comprises gene sets grouped by cell types. To accomplish this, we employ adaptive sparse multi-block partial least squares discriminant analysis (asmbPLS-DA) as the regression model, that performs feature selection in high-dimensional omics data using a multi-block structure [24]. A detailed description of asmbPLS-DA is provided in S1 File.
asmbPLS-DA is trained on human scRNA-seq as the response and as the block variable, both centred and scaled, with base class serving as the reference. The model structure is illustrated in Fig 1.
Using the fitted asmbPLS-DA, each human sample receives a superpathway score – the continuous predicted response. The reference shift for the superpathway is the median difference between target and base classes (Eq. (2)), which quantifies how strongly the superpathway separates human classes. We call this measure superpathway reference recapitulation.
We show that admits an additive decomposition both by cell type and by gene (Eq. (3)). The full derivation is provided in S1 File.
This result justifies defining cell type level human reference shift as the median difference of cell type contribution ( ) between target and base classes, which quantifies how strongly cell type b separates human classes. We call this measure cell type b reference recapitulation.
The gene contributions serve to attribute the cell type effect hierarchically – i.e., within each cell type b, genes explain additively and are later used for driver analyses of these shifts. In short, these reference recapitulation constructions provide a hierarchical quantification of the difference between human target and base classes – from superpathway ( ) to cell type ( ), with genes used for attribution.
In the second step of singIST, we ask a counterfactual question that links the disease model to the human reference data: if the human base class samples changed exactly as observed in the disease model, would the fitted asmbPLS-DA judge them more target class-like? However, directly comparing the disease model to human scores is not meaningful because they live on different scales and may differ by organism. We therefore create a commensurate yardstick by injecting the disease model’s changes into human reference data – while holding the fitted asmbPLS-DA fixed. We call these samples singIST treated samples and we construct them by applying the disease model to human expression base class samples only where a mapped cell type exists and there is a one-to-one ortholog; otherwise values are left unchanged (Eq. (5)).
Two biologically plausible scenarios define the transformation in Eq (5); ( ) the case where either cell type b does not exist in the disease model ( ) or disease model does not have a one-to-one ortholog of human gene g ( ); ( ) the case where both cell type b and a one-to-one ortholog gene of g exist in disease model.
We now turn to the third step of singIST. Having constructed the singIST treated predictors in step 2, we evaluate the counterfactual with the same fitted asmbPLS-DA from step 1 (identical loadings, scaling, and reference). Using the same derivations as in step 1, we summarize the disease model-induced shift relative to the base class human data.
Which we call the superpathway predicted recapitulation and cell type b predicted recapitulation, respectively. These two measures quantify how far the singIST treated samples move toward the target class in the human reference space when judged by the human fitted asmbPLS-DA model.
In the last step, we report predicted recapitulations as percentage of the human reference recapitulations.
To attribute cell type recapitulation to genes, we define the signed percent contribution of gene :
The increment does not depend on i; a proof is provided in S1 File. By construction, , and therefore partitions additively into gene level drivers. Genes without a mapped cell type, without a one-to-one ortholog, or with non-significant model contribute zero by Eq. (5).
2.2.2 Interpretation of recapitulation measures.
Recapitulation quantifies the concordance in both direction and magnitude of single-cell transcriptomic changes between the human reference and the disease model. A recapitulation score of indicates perfect agreement in direction and magnitude at the superpathway level, whereas a score of denotes an opposite direction but perfect agreement in magnitude. The interpretation for cell type recapitulation remains unchanged. Recapitulation is proportional to one-to-one orthology coverage, the number of mapped cell types, and the proportion of differentially expressed genes in the disease model. Within each cell type recapitulation , gene contributions indicate the direction and magnitude of each gene’s impact on the cell type recapitulation. Contributions are zero when the log2FC is non-significant, the gene lacks one-to-one orthology, or the cell type is unmapped. These properties are formally demonstrated in S1 File, and a comprehensive simulation study of recapitulation behaviour is presented in S5 File.
2.3 Validation methodology
2.3.1 Validity test of the optimal asmbPLS-DA.
Once the optimal model is selected, the validity of such for classifying between target class and base class is checked by a permutation test [25] adapted to small sample size and asmbPLS-DA. A null model distribution is generated by permuting Y, noted as , and setting the number of permutations. For each an asmbPLS-DA model is fitted with and as the optimal number of PLS components and quantile combination for each block and PLS component, respectively, and randomly taking one sample out to avoid overfitting. With the permuted model, Y is predicted for all samples under analysis, such prediction is compared against the true Y by F1 score. The rationale behind randomly permuting each Y element is that the original relationship of the model is disrupted while the dependence structure of is preserved [26], thus providing a control of a false positive model. If the optimal model is actually significant, it is expected that error measures increase substantially when permuting. To this end, the F1 LOOCV error of optimal model is compared against the of null distribution of F1 score, where is the confidence threshold and the quantile serves as the p-value which is adjusted for multiple comparison by Benjamini-Hochberg [27]; FDR is set to 0.1.
2.3.2 Parameter variabilities and significance of the optimal asmbPLS-DA model.
Cell type and gene importance, within a cell type b, may be assessed by considering the weighted average of its estimated coefficient, by taking as weights the relative importance of each PLS component qj [28]. To this end, we define the Cell Importance Projection (CIP) for cell type b:
Similarly we define the Gene Importance Projection (GIP) for gene g within cell type b:
Both indices verify ; this is proven in S1 File. The direction of CIP^b^ may be assessed by , and equivalently for . Note that CIP^b^ distribution is nested to the already estimated , the blocks with only a small number of relevant genes will assign higher values, being a cell type a cell type that does not contain any relevant information in classifying between target class and base class.
The distribution of a gene that is significant is likely to substantially differ from its associated null H0 distribution. A null distribution of of the form , and is generated by permuting all samples and genes within blocks. Note that permuting between blocks would not satisfy exchangeability assumption as distribution is dependent on . The median of distribution of the optimal model is compared against the null distribution by a Mann-Whitney U test, with the alternative hypothesis being the median greater than median of . P-value is adjusted by Bonferroni correction with a lower bound of expected number of true null hypothesis for each cell type ; the rationale is provided in S1 File.
3. Results
3.1 Atopic Dermatitis: Evaluation of mouse models
3.1.1 Training on human data and reference shifts.
Training the human reference on AD versus healthy yielded clear superpathway separation (FDR ), with gene set sizes spanning from 15 genes (CD40/CD40L signalling [PID]) to 701 (Cytokine signalling in the immune system [REACTOME]) (Table A in S3 File).
Using the CIP measure, each superpathway exhibited a distinct ranking of cell types that most strongly drive the AD-healthy prediction. For Dendritic Cells in Th1/Th2 Development [BIOCARTA], T-cells had the highest CIP, with secondary importance in melanocytes and dendritic cells, while keratinocytes and Langerhans cells ranked lower. JAK-STAT signalling pathway [KEGG] displayed diffuse CIP profile (similar importance across cell types), consistent with broad activation. Chemokine receptors bind chemokines [REACTOME] was led by keratinocytes and dendritic cells, whereas Cytokine-cytokine receptor interaction [KEGG] was driven primarily by antigen-presenting cells (APCs). Full CIP ranks, sparsity parameters, and the number of significant genes per cell type are reported in Table A in S3 File.
Gene level signals align with canonical AD biology and added cell type specific detail, with explicit literature cross-references in Table C in S3 file. In Dendritic Cells in Th1/Th2 Development [BIOCARTA], T-cells showed upregulation of IL13, IL5, and CSF2 with TLR7 down, and APCs highlighted ANPEP. In JAK-STAT signalling pathway [KEGG], T-cell interleukins (e.g., IL13, IL26, IL2RA, IL7) and interferon-axis components were prominent; keratinocytes showed IL15/IL15RA; dendritic cells emphasized SPRED1, SOCS1, and OSM; and melanocytes exhibited concurrent upregulation of CCND3 and CCND1, consistent with proliferative JAK-STAT activity (see S2 File for literature references). Per superpathway and cell type top-give genes are listed in Table B in S3 File.
3.1.2 Recapitulation across Oxazolone, Imiquimod, and Ovalbumine mouse models.
Fig 3 summarizes per-model one-to-one orthology to human and the corresponding superpathway recapitulation estimated by singIST. All three are mouse models; observed one-to-one orthology levels are broadly similar, but OVA shows lower observed orthology because fewer genes were sequenced per pathway. Coverage varies by pathway – for example, Asthma and Chemokine receptors bind chemokines are 60%, whereas IL12 signalling events mediated by STAT4 reach 100% (Fig A in S3 File).
INPUT) First, definition of a superpathway.(𝒫p) as a set containing cell types and genes. For each 𝒫p, there is a gene set 𝒢p from which gene subsets are derived for cell types {𝒢pb}b⊆𝒢p. Second, for each 𝒫p human scRNA-seq data are organized into matrix layers. Target class is the human experimental group that the disease model aims to mimic (i.e., disease), while base class is such that it should be differentiated from target class (i.e., healthy control). Third, for each 𝒫p disease models scRNA-seq log2FC are structured into vector layers. singIST METHOD) The method is organized into four steps, which run independently for each 𝒫p and disease model. Step 1) Objective: Quantify differences between target and base class human samples at various levels of granularity (superpathway, cell type, and gene) using asmbPLS-DA. Input: A 𝒫p and human scRNA-seq data. Output: Optimal asmbPLS-DA. From such, we derive cell type contributions (γ^ib) and gene contribution (δ^igb). With the contributions we compute similarity measures at the superpathway (Ω^) and the cell type levels (Γ^b). Step 2) Objective: Biologically unify the human data with the disease model data for subsequent comparison. Input: Human scRNA-seq base class samples and disease model scRNA-seq log2FC data. Output: Human scRNA-seq gene expression observed when disease model log2FC are applied, we call them singIST treated samples. The former is achieved in the Biological link function, which performs steps; one-to-one orthologous mapping; cell type alignment; translate log2FC to xigb. Step 3) Objective: Compute metrics of output from Step 1 between singIST treated samples and human scRNA-seq base class. Input: singIST treated samples, Human scRNA-seq base class samples and optimal asmbPLS-DA. Output: Pathway predicted recapitulation (Ω^′), Cell type b predicted recapitulation (Γ^b) and predicted gene contributions (δ^igb′). Step 4) Objective: Compute similarity metrics between human and disease model. Input: From step 1; Ω^ and Γ^b, and δ^igb. From step 3; Ω^′ and Γ^b′, and δ^igb′. Output: Predicted recapitulations as a fraction of the reference recapitulations (Ω^f, Γ^fb). Γ^fb is explained by contributing genes (Δ^fgb), providing interpretation on which genes drive the cell type recapitulation. OUTPUT) Ω^f and Γ^fb are displayed. Positive values show agreement in gene expression change between disease model and humans; negative values show opposition. Each Γ^fb equals the sum of its gene contributions Δ^fgb. Fig 2.
Superpathway recapitulation and observed one-to-one orthology of AD disease models.A) Superpathway predicted recapitulation as a fraction of the superpathway reference recapitulation for IMQ, OXA and OVA across all pathways under study. Negative recapitulations refer to opposed directions with human observed condition, while positive recapitulations define agreement in direction. B) Observed one-to-one orthology coverage refers to number of observed and one-to-one ortholog genes in disease model as a fraction of pathway gene set size. Despite the fact that disease models belong to the same organism mus musculus their differences in observed orthology one-to-one coverage come from sequenced reads.
Superpathway recapitulation for Dendritic Cells in Th1/Th2 Development was moderate for IMQ (27.7%) and OXA (16.8%), and null for OVA (0%). Cell type recapitulations are displayed in Fig 4, which shows moderate T-cell recapitulation for IMQ (54.6%) and no recapitulation for OVA (0%) and OXA (-1%). In OXA, the signal is instead carried by Langerhans cells (98.5%) and dendritic cells (35.6%). OVA has 0% for Langerhans cells across superpathways because that population was removed (<100 cells). Melanocytes are 0% in all models and superpathways (unmapped cell type). Fig 5 shows gene contributions to cell type recapitulation. In IMQ, T-cell recapitulation is driven by IL5 (95.1%) and CD7 (16.7%), whereas TLR7 contributes negatively (-57.4%) because it is suppressed in human AD but upregulated in IMQ. OVA has no DEGs in this pathway and therefore 0% recapitulation. Notably, the key T-cell marker IL13 in AD does not contribute positively in any model. JAK-STAT signalling pathway [KEGG] superpathway recapitulation was highest for OXA (81.2%), while IMQ (-114.9%) was opposite in direction and OVA (0.1%) was essentially null. OXA agreed in direction across all cell types except dendritic cells, with varying magnitudes. The negative score for IMQ was driven by strong discordance in Langerhans cells (-197.5%) and dendritic cells (-488.1%); gene-level contributions point to IL12RB2 in dendritic cells (-399%) and IL2RB in Langerhans cells (-224.2%). Cytokine-cytokine receptor interaction [KEGG] recapitulation was strongly negative for all models, with OVA . The disagreement was consistent across cell types. Gene-level disagreement drivers included multiple chemokine receptors (CCR2, CCR3) and ligands (CCL5, CCL24, CCL17, CXCL10, CXCL6). Chemokine receptors bind chemokines [KEGG] recapitulation was high for OVA (343.4%), moderate for IMQ (47.1%) and negative for OXA (-120.3%). The OVA signal is heavily concentrated in keratinocyte cell type, just two genes - CCL7 (179.9%) and CXCL6 (777.6%) - which largely accounts for its extreme value.
Cell type recapitulation for all AD disease models and superpathways under analysis.
Gene contribution and disease model estimated.rgb.A) Gene contribution to cell type recapitulation by disease model. If gene set size of pathway is greater than 50, only the top 5 contributing genes, for each cell type, were displayed. Blank gene contributions correspond to 0 values. B) Computed rgb by disease model. Grey log2FC refer to genes without one-to-one ortholog and/or not sequenced in disease model. Framed log2FC refer to statistically significant FDR≤0.05 genes, as per FindMarkers. Blank log2FC correspond to 0 values.
3.2 Hidradenitis Suppurativa: Evaluation of human skin explants ex vivo
3.2.1 Training on human data and reference shifts.
We first trained the human reference model on lesional HS skin versus healthy control skin. All superpathways for HS corresponded to immune and inflammatory infiltration of the dermis (T-cell activation/signalling, chemokine–cytokine signalling, myeloid programmes), and all of them were statistically significant under the permutation test (FDR ≤ 0.1), confirming that the disease and control separation is not due to chance (Fig D in S3 File).
Cell type importance projection (CIP) showed two recurrent patterns (Figs E and F in S3 File). A first group of superpathways was predominantly T-cell driven, with T-cells ranking clearly highest and myeloid cells contributing little. A second group showed a joint T-cell and myeloid signature, in which T-cells remained the top contributor but myeloid cells also carried a sizable part of the HS signal. Mast cells were not predictive in any superpathway, consistently appearing with negligible CIP. These superpathways define the human HS reference shifts that we later use to evaluate ex vivo explants with and without CD3/CD28 stimulation.
3.2.2 Recapitulation with and without CD3/CD28 stimulation in explants.
Since both organisms are human, observed one-to-one orthology was uniformly high (mostly >90%) for HS explants and controls; the small differences between disease models were due to genes not detected in the explants rather than to mapping issues (Fig G in S3 File).
In the unstimulated explants (HS in DMSO vs. Healthy in DMSO), singIST already identified a group of superpathways that reproduced the human HS signal (Fig 6A). These corresponded mainly to T-cell driven pathways and to mixed T-cell/myeloid pathways associated with dermal inflammation (Fig H in S3 File). When the same explants were stimulated with CD3/CD28, these pathways were the ones that improved: their superpathway recapitulation increased, and the gain was also visible in the T-cell and myeloid components and in the gene-level contributions (Fig 6A; Figs I and J in S3 File). In other words, stimulation amplified the pathways that were already aligned with the human reference.
Superpathway recapitulation of HS disease models.A) Superpathway recapitulation of well-recapitulated superpathways in HS disease models (HS Unstim vs. HC Unstim and HS Stim vs. HC Unstim). B) Superpathway recapitulation of bad-recapitulated superpathways in HS disease models.
By contrast, the superpathways that were not well recapitulated in the unstimulated condition did not become more human-like after stimulation (Fig 6B). This was especially clear for pathways whose human signal was predominantly a combination of T-cell and myeloid (Fig H in S3 File) or where explants showed an opposite direction in myeloid cells: even after CD3/CD28, cell-type recapitulation remained low or fragmented (Fig 7; Fig K in S3 File). The only clear exception was NF-κB signalling, which is T-cell driven in vivo but was still poorly reproduced ex vivo. This pattern is consistent with the heatmaps: stimulated and unstimulated explants display very similar transcriptomic profiles, and stimulation mainly produces a stronger version of the same response, rather than introducing a new one (Fig J and K in S3 File). At the same time, stimulation does what we would expect biologically in T-cells: ZAP70 activation marker that was absent in the unstimulated cultures appears after CD3/CD28, becoming a positive contributor to the Th17 cell differentiation [WP] superpathway, i.e., exactly in one of the pathways that improved under stimulation.
Cell type recapitulation for bad-recapitulated HS disease models.
An independent analysis of cell type composition showed that myeloid cells are expanded in vivo HS lesions but reduced in the unstimulated explants (Fig L in S3 File), which explains why myeloid driven HS pathways remain poorly recapitulated in the explants.
4. Discussion
Atopic Dermatitis (AD) represents a chronic skin-immune-mediated inflammatory disease (IMID), characterized by dysregulated T-cell mediated inflammation and keratinocyte differentiation [29]. We put a special focus on the discussion on pathways proven to be causal drivers of AD pathogenesis or related to its clinical severity; JAK-STAT signalling pathway, Dendritic Cells in regulating Th1/Th2 development, Cytokine-Cytokine receptor interaction, and Chemokine signalling pathway. Of the four AD pathways we examined, JAK–STAT signalling [KEGG] was the only one showing broad activation across cell types (TC, KC, MC, and APC), whereas the other pathways were activated in a more cell type–restricted manner. This is compatible with lesion associated inflammatory signalling in AD skin and with the concurrent upregulation of cell-cycle and survival genes reported in skin lesions, including the simultaneous activation of CCND3/Cyclin-D3 and CCND1/Cyclin-D1 in MC [30], as well as genes in the PI3K–AKT axis such as PIK3 CD and PIK3CB, and the upregulation of IL15 and IL15RA/IL15R in KC during inflammation.
It was not unexpected to observe few DEGs in OVA for the pathways analysed. This agrees with the low DEG counts in TC and DC reported by [21] and with bulk-RNA studies showing limited differential expression in AD-relevant pathways in this model [31]. The absence of MC in all three mouse models is expected, because in C57BL/6J and BALB/c pelage — including the ear, where biopsies were collected — largely lacks functional pigment producing melanocytes [32].
The Dendritic cells in Th1/Th2 development [BIOCARTA] pathway was the one with disease model and human mismatch. In IMQ, TLR7 is upregulated in T-cells, but in human AD it is suppressed, so TLR7 enters singIST with a negative contribution. This fits biology: TLR7 signalling drives Th1/Th17 responses [33], whereas its suppression favours the Th2 profile seen in AD [34]. A second mismatch is that none of the three mouse models induces IL13/IL4 through Th2 cells: IMQ shifts to IL17/IL22, and OVA/OXA produce IL13/IL4 mainly through infiltrating basophils/myeloid cells rather than Th2 cells [20,21]. Consequently, classical Th2 genes contribute little or even slightly negatively (e.g., IL13 in OXA with -5.5% contribution in T-cells).
In JAK–STAT signalling [KEGG], OXA showed the highest human-like shift (81.2%), with consistent agreement across cell types except dendritic cells, which were opposed in IMQ/OXA models. OVA was essentially null (0.1%), and IMQ was the opposite (−114.9%). This pattern is consistent with OXA being the murine model most often used for JAK-inhibitor studies in AD [35] and with bulk RNA data favouring OXA over OVA for this pathway [31]. One source of disagreement is STAT4, which is suppressed during Th2 development in human AD [36] but upregulated in IMQ and OXA, reflecting their non-Th2-skewed responses, and thus STAT4 contributing negatively to T-cell recapitulations.
Chemokine receptors bind chemokines [REACTOME] and Cytokine–cytokine receptor interaction [KEGG] share several high contributing genes. Many of these chemokines are stage-dependent in AD lesions. In our data, CCL5 (from the chemokine receptor pathway) showed large negative contributions in TC for both IMQ and OXA (and in DC for IMQ), whereas acute AD lesions are known to upregulate CCL5, while chronic lesions are known to suppress it [37]. Since the human samples [19] are likely chronic and the murine lesions are acute, this stage mismatch potentially explains the opposite directions across models and cell types.
In the HS application, singIST showed very consistent behaviour: pathways that already recapitulated the human HS signal in unstimulated explants were the ones that improved after CD3/CD28, whereas poorly recapitulated pathways stayed poor.
For the well recapitulated pathways, CD3/CD28 produced the expected boost: superpathway scores increased, the same cell types gained resemblance, and the same genes became stronger contributors (Fig 6A; Fig H and J in S3 File). This is the pattern we would expect after CD3/CD28, and it also shows that the stimulation was effective: ZAP70, an early marker of T-cell activation [38], appears in T-cells only in the stimulated explants in Th17 cell differentiation pathway [WP]. This means that the fact that most transcriptomic profiles remain very similar between unstimulated and stimulated conditions is not due to insufficient stimulation, but to the biology of the explant system (i.e., it can boost the T-cell component, but it cannot rebuild the full missing signals). By contrast, myeloid dependent HS pathways did not improve with stimulation (Fig 6B). Even after CD3/CD28, explants did not recover the human like myeloid signal, and at the cell-type level recapitulation remained fragmented (Fig 7; Fig K in S3 File). This agrees with reports showing that tissue-resident macrophages and microglia rapidly lose their tissue-imprinted transcriptional programme once they are placed in culture and only re-acquire it in the appropriate in vivo niche or under specific stimuli [39–41]. Our independent cell type composition analysis points in the same direction: in vivo HS skin showed an expansion of myeloid cells, whereas the unstimulated explants in culture showed a significant reduction (Fig L in S3 File). In other words, the culture condition preserves T-cell responsiveness to stimulation, but does not preserve the tissue-conditioned myeloid cells that drive part of the HS signature. Taken together, this supports a simple claim: the CD3/CD28-stimulated HS explant is the most appropriate model for those HS pathways that already recapitulate in the unstimulated condition, but not for myeloid driven HS pathways that do not.
While demonstrating significant capabilities, singIST presents several limitations, including dependence on pre-annotated cell types, the assumption of homogeneous effects when translating fold changes to human gene expression, and the requirement for well-defined human disease states (e.g., endotypes) prior to analysis. Additionally, extensions could be explored on differentiating changes due to cell type-specific gene expression and cell type proportions, between human classes. Further validation in additional disease contexts will solidify its utility in drug development and preclinical research.
5. Conclusions
Here we developed singIST, a computational framework for comparative single-cell transcriptomics that provides an integrative, explainable way to evaluate disease model similarity to human conditions at pathway, cell type and gene levels. Its application to murine models against Atopic Dermatitis and to human Hidradenitis Suppurativa skin explants (with and without CD3/CD28 stimulation), together with supporting simulation studies, shows that singIST can both recover known biology and pinpoint where models succeed or fail to recapitulate human disease.
Supporting information
S1 FileMaterials and methods.Detailed description of methodological procedures, model formulation, and data processing steps.(PDF)
S2 FileSample metadata.Metadata for all human and mouse samples, including demographics, experimental conditions, sequencing protocol, and GEO accession numbers.(PDF)
S3 FileExtended results.Supplementary tables and figures of Results section.(PDF)
S4 FileCell type mapping.Overview of the cell type harmonization between human data and mouse models, including cluster selection and mapping criteria.(PDF)
S5 FileSimulation study.Simulation studies to validate singIST recapitulation properties and benchmark against ODEGs.(PDF)
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Emmerich CH, Gamboa LM, Hofmann MCJ, Bonin-Andresen M, Arbach O, Schendel P, et al. Improving target assessment in biomedical research: the GOT-IT recommendations. Nat Rev Drug Discov. 2021;20(1):64–81. doi: 10.1038/s 41573-020-0087-3 33199880 PMC 7667479 · doi ↗ · pubmed ↗
- 2Elitt MS, Barbar L, Tesar PJ. Drug screening for human genetic diseases using i PSC models. Hum Mol Genet. 2018;27(R 2):R 89–98. doi: 10.1093/hmg/ddy 186 29771306 PMC 6061782 · doi ↗ · pubmed ↗
- 3Wei F, Wang S, Gou X. A review for cell-based screening methods in drug discovery. Biophys Rep. 2021;7(6):504–16. doi: 10.52601/bpr.2021.210042 37288368 PMC 10210057 · doi ↗ · pubmed ↗
- 4Shegokar R. Chapter 2 Preclinical testing understanding the basics first. In: Shegokar R, editor. Drug delivery aspects. Elsevier; 2020. p. 19–32.
- 5Steinmetz KL, Spack EG. The basics of preclinical drug development for neurodegenerative disease indications. BMC Neurol. 2009;9 Suppl 1(Suppl 1):S 2. doi: 10.1186/1471-2377-9-S 1-S 2 19534731 PMC 2697630 · doi ↗ · pubmed ↗
- 6Loewa A, Feng JJ, Hedtrich S. Human disease models in drug development. Nat Rev Bioeng. 2023:1–15. doi: 10.1038/s 44222-023-00063-3 37359774 PMC 10173243 · doi ↗ · pubmed ↗
- 7Storey J, Gobbetti T, Olzinski A, Berridge BR. A structured approach to optimizing animal model selection for human translation: the animal model quality assessment. ILAR J. 2021;62(1–2):66–76. doi: 10.1093/ilar/ilac 004 35421235 PMC 9291347 · doi ↗ · pubmed ↗
- 8Normand R, Du W, Briller M, Gaujoux R, Starosvetsky E, Ziv-Kenet A, et al. Found In Translation: a machine learning model for mouse-to-human inference. Nat Methods. 2018;15(12):1067–73. doi: 10.1038/s 41592-018-0214-9 30478323 PMC 12396630 · doi ↗ · pubmed ↗
