Integration of elemental imaging and spatial transcriptomic profiling for proof-of-concept metals-based pathway analysis of colon tumor microenvironment
Aruesha Srivastava, Neha Shaik, Yunrui Lu, Matthew Chan, Alos Diallo, John Zavras, Serin Han, Tracy Punshon, Brian Jackson, Linda Vahdat, Xiaoying Liu, Vivek Mittal, Ken Lau, Jiang Gui, Louis Vaickus, Jack Hoopes, Fred Kolling, Laurent Perreard, Jonathan Marotti, Joshua Levy

TL;DR
This study shows how combining metal imaging with gene data can reveal new insights into how metals influence tumor growth and biological pathways in colon cancer.
Contribution
A novel spatial multimodal workflow integrating elemental imaging and spatial transcriptomics to explore metal-gene interactions in the tumor microenvironment.
Findings
Elevated iron correlates with mesenchymal phenotypes and EMT-related gene expression at the tumor’s proliferative front.
High copper concentrations in active tumor growth regions correlate with increased immune response gene expression.
The workflow demonstrates feasibility for identifying metals-based gene correlates in the tumor microenvironment.
Abstract
The complex interplay between metal abundance, transport mechanisms, cell distribution, and tumor progression-related biological pathways (e.g. metabolism, collagen remodeling) remains poorly understood. Traditionally, genes and metals have been studied in isolation, limiting insights into their interactions. Recent advances in spatial transcriptomics and elemental profiling now enable comprehensive exploration of tissue-wide metal-gene interactions, though integration remains challenging. In this proof-of-concept study, we investigated metal-dependent signaling within the tumor microenvironment of a unique colorectal cancer (CRC) tumor. We implemented a spatial multimodal workflow which integrated elemental imaging, gene expression, cellular composition, and histopathological features to uncover metals-related pathways through spatially resolved gene expression correlation analyses.…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4- —NIH10.13039/100000002
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTrace Elements in Health · Molecular Biology Techniques and Applications · Ferroptosis and cancer prognosis
Background and introduction
Colorectal Cancer (CRC) represents a significant global health challenge, accounting for nearly 10% of all cancer cases and ranking as the second-leading cause of cancer-related deaths worldwide [1]. The rising incidence of CRC among younger demographics underscores an urgent need to advance screening, prognostic tools, and therapeutic approaches [2, 3]. Central to improving outcomes is a deeper understanding of the mechanisms underlying tumor progression and metastasis, which are responsible for approximately 90% of cancer mortalities and associated with sharply declining survival rates at advanced stages.
Elements significantly influence cancer progression through roles in cell proliferation, invasion, motility, adhesion, and more [4–6]. Metals such as copper (Cu), iron (Fe), and zinc (Zn) are vital for enzymatic reactions essential for mitochondrial respiration, DNA repair, senescence, and immune regulation [7–18]. Metals also modulate signaling pathways through metalloallostery, influencing nutrient sensing and protein regulation. Elevated levels of Cu and Fe can also contribute to reactive oxygen species (ROS) production, which promotes angiogenesis and disrupts DNA repair, enhancing tumor invasion and metastasis [15, 19, 20]. The interplay between various essential and non-essential elements is integral to tumor growth and metastasis, and research into these interactions and their biological function will provide insights into element-dependent vulnerabilities, offering potential targets for novel therapeutic interventions. For instance, a recent Phase II trial with tetrathiomolybdate (TM), a Cu chelator, demonstrated promising results in improving progression-free and overall survival rates for breast cancer patients at high risk of metastasis [21–23]. TPEN, another Cu chelator, selectively targets CRC cells due to their higher Cu accumulation [12, 17]. Despite advances in metal-based diagnostics and therapies, their clinical application faces significant challenges due to incomplete understanding of how metals are distributed within tumors and their specific roles in intra- and inter-cellular signaling within the tumor microenvironment (TME). The complexity of elemental distribution, influenced by factors such as dietary intake and unique cellular uptake and export mechanisms, results in a dynamic and heterogeneous metal landscape within tumors [24–26]. This complexity is further exacerbated by the presence of metals in various cellular pools, ranging from tightly bound to more labile forms that interact with small molecules [27].
A deeper understanding of metal distribution within tumors has been traditionally limited by bulk analysis methods, which overlook the nuanced spatial distribution of metals crucial for understanding their role in cancer pathogenesis. High-resolution elemental imaging (EI) such as Laser Ablation Inductively Coupled Plasma Time of Flight Mass Spectrometry (LA-ICP-TOF-MS) offers a significant breakthrough by providing precise localization of metal accumulation. This technology enables detailed mapping of metal pools within the tumor landscape, revealing its potential influence on cancer progression and patient outcomes. It holds the potential to revolutionize cancer diagnosis and treatment by identifying unique multi-element signatures indicative of tumor states.
Complementing this, recent technological advances in spatial transcriptomics (ST) enable high-resolution [28], untargeted spatial expression profiling of almost all protein-coding genes, providing insights into the genetic factors governing metal transport and their toxicological implications [29]. The conserved nature of metal transport proteins across species underscores the relevance of these genetic studies. ST overcomes the limitations of bulk analysis, which can obscure tissue-specific relationships, and of multiplexing assays, which are restricted to a limited number of protein candidates [30–38]. By allowing for the profiling of the entire transcriptome at high spatial resolution, including at the single-cell level, ST can potentially be leveraged to pinpoint specific cellular interactions or markers of tumor progression.
The spatial location and abundance of essential elements within tissues reflect complex processes of availability, homeostasis, and biological necessity. For instance, the homeostasis of essential elements involves a myriad of proteins that sense, signal, chaperone, and control their movement [39, 40]. Thus, integrating spatial transcriptomics and elemental imaging technologies has the potential to reveal intricate metal-biomolecular interactions that may be missed by traditional reductionist approaches that would analyze each assay in isolation [41]. Yet, the absence of dedicated software platforms specifically designed for such integrative tasks has been a significant obstacle, explaining, in part, why comprehensive pathway analysis for metals in cancer remains an unrealized goal to date [42–45].
To address this gap, the Biomedical National Elemental Imaging Resource (BNEIR) developed TRACE (Tissue Region Analysis through Co-registration of Elemental Maps), co-registration software that facilitates the spatial integration of elemental imaging data with histopathology, immunohistochemistry/multiplex immunofluorescence, and spatial transcriptomics technologies (Fig. 1) [41, 46, 47]. This exploratory proof-of-concept study builds on TRACE-enabled integration to further characterize elemental and transcriptomic data from the primary tumor of a single colorectal cancer (CRC) case. While this represents the first application of such a workflow in a cancer context, a recent study similarly demonstrated proof-of-concept spatial integration of metal mapping via synchrotron X-ray fluorescence (XRF) with spatial transcriptomics (ST) in mouse hippocampal tissue [48]. These early demonstrations underscore a growing interest in the integration of spatially resolved elemental and transcriptomic data.
Overview: spatial integration of spatial elemental imaging and spatial transcriptomics can reveal genes associated with metal bioaccumulation within specific tissue architectures, shedding light on metals-related pathways and cellular changes associated with tumorigenesis; BNEIR: biomedical national elemental imaging resource; TRACE: tissue region analysis through co-registration of elemental maps.
Through a multimodal analysis, we aim to demonstrate the potential for uncovering valuable insights into the interplay between genetic and elemental landscapes in cancer pathology, paving the way for a more comprehensive understanding of CRC progression. The primary objectives of this study are twofold: firstly, to identify correlations between gene signatures and metal abundance within specific cellular architectures and cell types in CRC; secondly, to leverage these insights to develop a metals-based pathway analysis. We believe this initial investigation will facilitate a more comprehensive exploration of metal-related pathways in cancer across larger cohorts, enhancing our understanding of pathogenesis, metastasis, and progression (Fig. 1). Ultimately, this research seeks to clarify the role of metal bioaccumulation in tumor dynamics, potentially leading to the discovery of novel biomarkers and the development of more effective therapeutic strategies.
Results
Results overview
We conducted a comprehensive ‘metals-based pathway analysis’ on a primary tumor section from a CRC patient, aiming to uncover associations between the abundance of various metals and gene pathways within distinct tissue architectures. The primary tumor, originating in the patient’s cecum at pathologic T-stage 3 (indicating invasion through the muscularis propria), advanced to stage 2a lymph node involvement and metastasized to the liver. This section provides a concise overview as context for the subsequent findings:
Spatial transcriptomics (ST) profiling: Utilized the 10x Genomics Visium Spatial Gene Expression (spatial transcriptomic—ST) assay to capture spatial variations in the expression of approximately 18 000 genes across 55-micron spots. This was complemented by high-resolution 40X H&E-stained whole slide imaging (WSI; Leica Aperio GT450) on the same section. Spatial elemental imaging (EI): A serial section was analyzed to profile all elements and their isotopes at 5-micron resolution using laser ablation inductively coupled plasma time-of-flight mass spectrometry (LA-ICP-TOF-MS). Spatial data integration: Achieved through TRACE (Tissue Region Analysis through Co-registration of Elemental Maps), which facilitated the spatial alignment of ST and EI data. Pathologist annotations: Annotated WSI identified regions inside, around, and away from the tumor, among other tissue architectures such as epithelium, serosa, and subcutaneous fat. Elemental hotspot analysis: Identifies hotspot areas of high and low metal abundance using Getis Ord* statistics [49]. This approach has previously been used in multimodal biomarker studies involving elemental imaging [50] and in this case it is being used to transform raw elemental maps into spatially contextualized features to identify regions of metal enrichment or depletion that may correlate with underlying biological pathways. Cell typing: Integrated single cell RNASeq data from a serial section to characterize cell types within these hotspots. Spatial gene expression correlations: Conducted a transcriptome-wide comparison of gene expression in areas with varying metal abundance. Pathway analysis: Gene set enrichment analysis performed on statistically significant genes to elucidate the connections between metal abundance and various biological processes. Metal-gene correlations were also visualized across genes contained within several select, relevant Cu homeostasis and tumor progression pathways for additional context, as an example of how spatial data integration can recapitulate and expand on known biological mechanisms and pathways. Factor/interaction analysis: Employed machine learning and clustering approaches to reveal distinct profiles of metals, genes, and cell types associated with different tissue histologies.
Identifying tissue architectural and cellular components associated with high elemental abundance
Our hotspot analysis revealed distinct elemental signatures associated with various histological structures (Fig. 2A-C, Figure S1, Tables S1-S2). Notably, the tumor regions were enriched in Cu, Mg, Fe, and Mn (higher Gi* statistic) as compared to other annotated tissue regions such as the muscularis propria. Zn concentrations were particularly elevated at the tumor—interface zone relative to tumor-only and serosal tissue, with even higher abundance observed in the muscularis propria. Additionally, the muscularis propria showed significant enrichments of metals such as K, Ca as compared to serosal tissue.
Metal distribution in tissue architectures: (A) Pathologist annotation of tumor, tumor interface, and surrounding musculature and serosa, (B) Boxplot demonstrating differences in Gi hotspot statistics across various tissue architectures, (C) Clustered heatmaps comparing average hotspot abundance (Gi*) by metal within select architectures and relative differences in hotspot abundance between architectures (positive z-score indicates higher metal abundance on architecture to left of inequality), (D) Demonstration of metal abundance mapped across slide for Fe and Mn (red indicates hotspot, blue indicates coldspot), juxtaposed with cellular abundance (red indicates higher abundance), (E) Dotplot demonstrating spearman correlation associations between cell type abundance and elemental distribution—larger red dots indicate positive associations of greater magnitude, whereas blue dots indicate negative associations, with size indicating magnitude.*
Deconvolution of ST into cell-type proportions at each Visium spot using Cell2Location revealed significant associations between various metals and specific cell types (Fig. 2D,E, Figure S1, Table S3). For instance, Cu was positively associated with presence of mast cells, B naive cells, endothelial cells, macrophages, proliferating B cells, T cells, and plasma cells, indicating a broad involvement across immune and vascular functions. Fe was associated with a mesenchymal phenotype, plasma cells, and B memory cells. Mg exhibited positive associations with B cell precursors and proliferating B cells. Mn was positively linked to dendritic cells. Zn and K both showed negative correlations with epithelial cells and red blood cells. Ca exhibited negative correlation with monocytes. In this section, we highlighted only a representative subset of significant associations, with the full set available in Table S3.
Metals-based pathway analysis
Our gene expression metal correlation analyses uncovered associations of metal accumulation with expression of genes from a wide range of biological pathways, some of which relate to cellular composition, immune responses, and tissue architecture (Tables S4-S7). A demonstration of the spatial covariation between elements and specific genes can be found in Fig. 3, Figure S1. This analysis highlighted both shared and unique roles of different metals in key cellular processes, including but not limited to:
Pathway analysis findings: (A) visual representation of ST data through leiden clustering, revealing distinct regions, (B) cu and zn gi hotspot statistics, (C) visualization of spatial distribution of IDO1 and DES gene expression demonstrating association with respective elements, (D) visualization of interferon gamma response and smooth muscle contraction through aggregating gene expression across pathway compared to a background distribution, again found to be associated with respective elements, (E) volcano plot mapping spearman correlation between 18 074 genes and gi* statistics for respective metals on x-axis and -log10(p-values) on y-axis—p-value and correlation cutoffs were set at 0.05/18 074; (F) pathways associated with respective elements (top includes most significant msigdb cancer hallmarks for cu abundance and bottom includes most significant reactome pathways for zn abundance).*
Immune Response, Inflammation, and Complement Activation. We identified metals which co-occurred with genes involved in key immune signaling pathways such as Interferon Gamma Response (Cu, p = 1.09e-12; Mg, p = 6.46e-18; Mn, p = 5.92e-18) and Allograft Rejection (Mn, p = 1.38e-29; Mg, p = 2.23e-22). Additionally, these metals were correlated with genes from Complement Activation and IL-2/STAT5 Signaling (Fe, p = 0.000175) pathways, indicating their significant roles in modulating immune responses within the TME. Cu was also prominently linked to genes involved in IL-6/JAK/STAT3 Signaling (p = 0.00505), a pathway known to be involved in inflammatory responses and immune regulation. Moreover, Mg was associated with genes in pathways like PD-1 Signaling (p = 1.47e-06) and Phosphorylation of CD3 and TCR Zeta Chains (p = 6.42e-05), further showing the correlations of these elements with immune cell activation and signaling.
Epithelial-Mesenchymal Transition (EMT), Cell Motility, and Tissue Remodeling. From this individual Colon resection, Fe, along with K and Zn, was associated with genes involved in EMT (Fe, p = 4.38e-21; K, p = 1.07e-11; Zn, p = 1.74e-07), cell motility, and Cell-extracellular Matrix Interactions (Fe, p = 9.41e-11; Zn, p = 2.06e-07). Zn abundance was linked to genes involved in Apical Junctions (p = 1.17e-06) and RHO GTPase (p = 5.73e-05) activation, both of which are crucial for cell adhesion, migration, and tissue remodeling.
Myogenesis and Muscle Contraction. Myogenesis was strongly associated with Zn (p = 1.1e-18) and K (p = 5.48e-15); smooth muscle contraction with Zn (p = 1.69e-17) and K (p = 2.36e-15); muscle contraction with Zn (p = 5.79e-18) and K (p = 1.82e-14); and cell—extracellular matrix interactions with Zn (p = 2.06e-07) and K (p = 1.57e-05).
Cell Signaling, Apoptosis, and DNA Repair. Genes related to cell signaling, apoptosis, and DNA repair were notably enriched for elements such as Cu, Mg, Mn, and Fe. For instance, Cu showed significant enrichment in the genes from G2-M Checkpoint (p = 1.30e-07) pathways. These elements also showed enrichment in apoptosis-related gene pathways, including Mg (p = 7.49e-03), Fe (p = 6.56e-03), and Mn (p = 1.48e-03).
Oxidative Stress, Cellular Repair, and Hormonal Regulation. Fe was associated with genes related to oxidative stress, cellular repair mechanisms, and hormonal responses. Fe abundance was also correlated with genes in the Hypoxia pathway (p = 0.000881) and Angiogenesis (p = 0.0104) pathways that reflect cellular adaptation to low oxygen levels, a hallmark of rapidly growing tumors. Additionally, Fe was strongly linked to genes involved in KRAS Signaling Up (p = 1.60e-16), a key oncogenic signaling pathway. From this individual specimen, these associations suggest further areas of inquiry to explore the multifaceted importance of Fe in tumor growth, stress responses, and repair mechanisms within the TME. Additionally, the relationship between element abundance and gene expression changes in the p53 pathway—Ca (p = 1.25e-08), K (p = 6.17e-04), Mg (p = 1.30e-02), Zn (p = 1.43e-04), and Mn (p = 2.28e-04)—may reflect alterations in hormonal signaling and cellular stress responses within the TME.
Metabolic Regulation and Signal Transduction. Genes from metabolic pathways, including glycolysis and cholesterol homeostasis, were frequently enriched in Zn and Cu, highlighting their roles in tumor metabolic adaptations. For example, Zn was significantly associated in genes related to Glycolysis (p = 7.09e-07), while, as another example, Cu showed enrichment in mTORC1 Signaling (p = 5.05e-03), emphasizing their contributions to the metabolic flexibility required for tumor survival within the TME.
Further examination of cu homeostasis and fe-related emt pathways
We further examined the relationship between specific genes and metal abundance within key pathways using PathVisio, overlaying correlations onto WikiPathway diagrams (Figures S2-S3) [51–56]. For genes correlating with Cu abundance, significant associations were observed within Cu homeostasis pathways. In particular, metal ion solute carrier (SLC) transporters MT1E (ρ=0.40, p < 0.001) and MT1G (ρ=0.40, p < 0.001) were strongly correlated with Cu levels. Additionally, ATOX1 (ρ=0.15, p < 0.001), a Cu chaperone delivering Cu + to P-type ATPases such as ATP7A (ρ=0.31, p < 0.001), exhibited notable correlations. Through its suppressive effects on SOD3, a key antioxidant gene, ATP7A may influence tumor progression [57–60]. SOD3 protects against oxidative stress and maintains cellular redox balance, and its downregulation has been associated with increased oxidative stress and induction of EMT, processes linked to tumor metastasis [61]. Correspondingly, SOD3 (ρ=-0.36, p < 0.001) showed the strongest negative correlation with Cu abundance within this pathway, suggesting that elevated Cu levels may drive downregulation of this protective gene through upregulation of ATP7A, further promoting cancer aggressiveness.
For Fe abundance, spatial correlations were identified with specific genes linked to EMT-related pathways. Among these, FN1 (ρ=0.34, p < 0.001), which encodes fibronectin, a protein involved in promoting cell motility through collagen matrix remodeling and serving as a marker for cancer-associated fibroblasts [62, 63], was most notable. Additional key correlations were observed with SPARC (ρ=0.36, p < 0.001), which phosphorylates focal adhesion kinase (FAK) to stimulate tumor cell invasion [64], and MMP9 (ρ=0.32, p < 0.001), a matrix metalloproteinase associated with extracellular matrix (ECM) degradation, lymph node metastasis, and poorer survival outcomes [65, 66]. By examining these specific gene correlations within their respective pathways, our analysis highlights the intricate interplay between metal abundance and gene activity in the tumor microenvironment.
Spatial clustering of metals, genes, cell-types
Our analysis thus far has suggested that metal concentrations are correlated with gene expression in key biological processes, including immune response, cell cycle regulation, and extracellular matrix (ECM) interactions. However, these associations raise important questions about how these processes are spatially organized within the tumor microenvironment. Spatial colocalization of metals and specific cellular activities is likely more pronounced in certain regions of the tissue, where distinct tissue architectures may drive localized biological effects. To explore these spatial patterns and their implications for tumor behavior, we conducted a spatial factor analysis to summarize these associations within distinct tissue architectures (Fig. 4).
Spatial Multimodal Factor Analysis Results: (A) Plotting of factor loadings from four of nine discovered spatial factors, (B) Clustermap demonstrating factor loadings averaged across architecture, (C) Relative differences in factor loadings between tissue architectures, hierarchically clustered, (D) Composition of factors, top five features per data type (genes, cell types, metals); contributions of these features were normalized to sum to one within each data type.
The spatial factor analysis using MEFISTO yielded nine factors corresponding to a variety of biological pathways (seeFigures S4-S5, Tables S8-S10) [67]. Factor 1 was primarily concentrated within the tumor and was associated with the accumulation of Mn, Mg and Cu, alongside pathways related to mucin O-glycosylation, glycolysis and estrogen response [68–70]. Factor 4 was linked to a mesenchymal phenotype and associated with Fe, showing strong connections to gene signatures tied to EMT. Factor 5 was localized at the tumor interface, associated with both epithelial and macrophage cells, and was enriched in pathways related to EMT and immune responses, including Dectin-2 and Defective GALNT12 causes CRCS1. Finally, Factor 6 was connected to myogenesis, muscle contraction, and monocyte presence, with a concentration of Zn at the tumor interface and in the muscularis.
How much of the spatial elemental distribution can be explained by transcriptomic, cellular, and pathway activity variation?
We used a multivariable predictive modeling approach with the MISTy package to evaluate how well transcriptomic, cellular, and pathway activity variations explain the spatial distribution of elements within the tumor microenvironment [71]. This approach employed spatially weighted random forest models to predict metal concentrations based on pathway activity scores, cell type abundances, and the spatial gene expression of selected genes at the same or neighboring spots. Our analysis found that a substantial proportion of the variation in elemental distribution can be attributed to these spatial transcriptomic data types. For elements such as K, Cu, Mn, Mg, Ca, and Fe, the models achieved R² values exceeding 75%, this multivariable analysis demonstrates that a gene/pathway/cell-type signature exists that is associated with this metal distribution (Figure S6,Table S11).
Among the predictors, spatial gene expression markers were the most significant influences upon elemental variation. In contrast, pathway activity scores and cell type abundances were less predictive, potentially due to the information loss during data aggregation. Interestingly, when incorporating other elemental concentrations as predictors, the majority of the variation in elemental distribution was accounted for by these other elements and cell types, rather than gene expression alone. A comprehensive list of gene, cell type, and pathway interactions with metals is provided in Figure S6 and Table S11.
Discussion
Targeting metal-dependent signaling and chaperoning within and around tumors has emerged as a promising strategy for inhibiting tumor growth and spread [5, 12, 72]. However, this approach is challenging due to the limited understanding of how metals accumulate within tumors. Developing a comprehensive map of the conditions and consequences of metal accumulation in tumors and their surrounding microenvironment is crucial for advancing next-generation cancer therapies. The differential bioaccumulation of endogenous metals, which may reflect variations in nutrient intake, storage, or disruptions in homeostasis, underscores the need for such a map. This metal map would provide valuable insights into the bidirectional molecular mechanisms governing metal accumulation, whether in excess or deficiency. By understanding how these metals contribute to tumor progression, we can better identify potential biomarkers and therapeutic targets, paving the way for more effective treatments.
Given evidence that lifestyle factors (such as dietary intake) and copper chelation can influence metal distributions within tumors—potentially impacting metabolic activity, inflammation, and tumorigenesis—we conducted this proof-of-concept study in colorectal cancer. Our aim was to establish an informatics workflow capable of spatially characterizing these effects when applied in future cohort studies. The computational workflow that was developed and implemented is the first of its kind and has uniquely layered on spatial transcriptomics atop elemental imaging to facilitate a metals-based pathway analysis. Our workflow offers a preliminary understanding of dynamic interplay between metallic and molecular alterations within the TME by establishing a catalogue of associations between metals, histologies, cells and genes that warrant further investigation.
By integrating histology imaging and pathology annotation, we observed in this sample metal bioaccumulation across various tumor microenvironment compartments, with Cu localizing intratumorally and Fe accumulating at the tumor’s proliferative front and within the stromal architecture. Cu, essential for mitochondrial energy metabolism in cancer cells, was primarily localized within the tumor, corroborating prior studies showing that Cu chelation can kill colon cancer cells by preventing redox cycling and thereby reducing the generation of reactive oxygen species [15, 17, 19, 20]. Fe, on the other hand, was found in the tumor stroma [73, 74]. This finding aligns with prior research suggesting that Fe accumulation in this region may result from residual Fe following intralesional hemorrhage [75, 75]. Larger tumors/polyps, often with delicate neovascular vessels, are prone to bleeding, which could contribute to this stromal Fe deposition, as demonstrated through supporting literature [75]. Indeed, deposits of Fe were found to be associated with hypoxia-related genes. These findings were further supported by spatial integration of single-cell data with ST to derive cell-type proportion estimates, which suggests an epithelial association for Cu and a mesenchymal/stromal phenotype for Fe. The presence of Fe in regions tied to stromal/collagen remodeling aligns with the epithelial-to-mesenchymal transition, a key process in tumor progression. Additionally, Cu was not only found within tumor areas but also co-localized with endothelial cells, plasma cells, and macrophages. Pathway analysis recapitulated Cu’s dual role: promoting CRC proliferation, as indicated by enrichment in Mitotic Cell Activity, and stimulating the antitumoral immune response through interferon signaling and immune recruitment via the surrounding vasculature and lymphatics, consistent with prior literature [76]. Both Cu and Fe findings were reinforced by spatial gene expression metal correlation analyses and visualization of spatial metal-gene correlations overlaid on pathways governing Cu homeostasis, metabolic reprogramming, and Fe-related EMT processes. These pathways highlight where their shared contributions relate to aggressive tumor phenotypes. Future studies are needed to validate these associations and disentangle the roles of Cu and Fe within specific cell mixtures localized in distinct tissue architectures. Together, these findings provide a nuanced understanding of Cu and Fe’s roles in tumor progression, consistent with prior literature while demonstrating the potential for new insights spurred through spatial integration and pathway-level analyses.
While Cu and Fe are key metals in cancer biology, other metals, such as Mn and Mg, also showed significant tumor concentrations and warrant further investigation for their distinct roles. Neither Mg nor Mn was associated with an epithelial phenotype, suggesting their involvement with other tumor-infiltrating cells. Previous research has linked increased cellular Mg to DNA and protein synthesis, as well as tissue growth, which aligns with our observation of Mg’s localization within the tumor and its association with the G2M cell cycle pathway [77]. Like Cu, Mg was also highly enriched for genes associated with inflammatory signaling via the Interferon/cytokine signaling pathway, among others. In contrast, Mn localized within the tumor likely due to its role in the antitumor immune response [78]. Prior studies have suggested Mn’s role in promoting dendritic cells, which is consistent with our findings showing the strongest correlation of Mn with dendritic cell presence within the tumor.
In our sample, zinc was localized at the tumor interface. Prior studies have linked zinc to alterations in cytoskeletal integrity, motility, and invasiveness in colon cancer cells, suggesting a potential role in chemoprevention. This is consistent with the observed activation of pathways related to myogenesis and muscle contraction, though its precise function within the tumor context remains unclear.
The presence of K and Ca at the tumor interface may reflect long-term accumulation in the colon’s muscularis propria, which consists of older, longer-lasting cells. This buildup is likely due to extended exposure to these elements. As a validation, we also found Ca ions within the same regions. Previous studies have shown that Ca affects intestinal smooth muscle contraction, partly by influencing its permeability—consistent with our findings of Ca within these architectures [79]. Additionally, both Ca and K were related to Ca²⁺-activated K⁺ channels, supporting this effect and validating our approach.
The integration of spatial multimodal modeling frameworks such as MEFISTO and MISTy offers a unique opportunity to jointly model metals, gene expression, and cell types in space within a multivariate context. This approach enables the identification of context-specific interactions that may serve as biomarkers or therapeutic targets. Importantly, such analyses can reveal molecular regulators that inform more balanced therapeutic strategies—for example, enhancing treatment efficacy while minimizing adverse effects like immune suppression. In particular, MISTy excels at detecting non-linear, non-parametric interactions across spatial neighborhoods. Its ability to predict the behavior of one feature (e.g. metal abundance) based on spatial proximity to others (e.g. immune cells or transcripts) supports the identification of spatially associated patterns that may generate hypotheses about underlying regulatory relationships. For example, MISTy revealed a spatial association between Cu and T cells, consistent with Cu’s known role in IL-2—mediated T cell activation. This aligns with prior studies showing that Cu chelation may impair immune function, underscoring the need to consider such metal—immune interactions in the design of targeted therapies [80, 81].
In addition to Cu—T cell interactions, MISTy uncovered several other associations of interest: 1) A spatial link between Ca and glutamine metabolism (GPT expression), aligning with prior work on the regulatory interplay between reduced glutaminolysis and enhanced Ca²⁺/NF-κB signaling [82]. 2) An association between K, TNFα activity, and monocyte abundance, which reflects earlier findings that K⁺ channel activity modulates cytokine production in activated macrophages [83, 84]. 3) A spatial relationship between Mn and PDCD1 (PD-1) expression, consistent with emerging evidence on the role of Mn in modulating immune responses and improving the efficacy of checkpoint immunotherapy [77]. These are just a few examples among many that demonstrate how highly integrative analytical frameworks can uncover biologically meaningful interactions for further follow up that may be difficult to detect using univariate or modality-specific approaches.
These associations underscore the unique potential enabled by integrating these advanced technologies. It’s important to highlight that a significant proportion of elemental distribution within tumors has been confirmed to have biomolecular underpinnings that govern not only metal deposition, redistribution, and chaperoning but also the downstream consequences of metal presence, such as inflammation. Developing mechanistic associations between these findings and tumor biology will take time, but the progress is promising.
The advantage of pathway analysis through whole transcriptomic profiling lies in the broad scope and range of pathways that can be explored, offering real potential for biomarker and intervention discovery. However, it is crucial to emphasize that while some findings in this manuscript are confirmatory, they are largely exploratory and require validation and expansion to a larger, unbiased cohort. Some of the pathways identified in this analysis reflect normal colonic function rather than tumorigenesis. We have been careful to limit the set of considered elements to avoid overstating conclusions. Future work will delve deeper into cell-type associations using complementary imaging techniques and will also investigate alterations that exceed those attributable solely to tissue architectural changes. Further work is needed to establish causal associations between metal distributions and specific tissue architectures, cell types, transcripts, and gene expression programs. While suggestive, these relationships are beyond the scope of this study. For example, metal bioaccumulation observed at the leading edge of tumors could reflect normal cellular processes rather than tumor-specific activity. In this work, we were not able to disentangle transcriptome-wide pathway correlations from the underlying cell types mediating them. Future studies will focus on cell-type-specific analyses to better resolve these associations and determine their biological relevance.
Key indicators of tumor progression and metastasis can only be fully understood when employing these technologies at scale, across an expanded cohort. CRC progression and tumorigenesis are heavily influenced by various patient- and tumor-specific factors, including tumor sidedness, genetic and lifestyle factors, invasiveness, grade, sex, and other confounders and modifiers such as mismatch repair deficiency, as seen in this patient, which can contribute to a T-cell exhausted phenotype. Scaling this analysis will also require addressing challenges such as batch effects and variability in tissue architecture across samples. The inclusion of biological and technical replicates, as well as normal adjacent tissue, will be critical for managing patient heterogeneity. Addressing these complexities will be essential to advance our understanding of metal biology in CRC and related treatments. Similar considerations are needed for further applications of this computational workflow to other tumor types.
Furthermore, tissue sections were separated by 5 microns, which assumes smooth tissue changes and may lead to potential imprecisions in co-registration, thereby impacting the findings. Profiling device noise also introduces additional challenges with regards to precision. Tissue was analyzed with paraffin intact which can lead to random signal attenuation due to build-up of paraffin within the LA-ICP-TOF-MS capture tube if the capture tube is too small. We did not deparaffinize because it has the potential to shift elemental distribution and reduce abundance. There are also biological buffers employed at various parts of tissue preparation that could influence findings—we were careful to remove elements that could have been influenced by preparation or overly represent individual variation. However, it is important to acknowledge that these sample preparation methods—while necessary given the practical challenges of obtaining flash-frozen human tissue—can alter the native distribution of metals [85–88]. For example, formalin, a commonly used fixative in FFPE preparation, is known to contain trace levels of metals such as Cu—contamination can occur when fixatives are applied in large volumes relative to tissue mass and because tissues contain numerous metal-binding sites. At the same time, fixation and embedding can also leach labile metals from the tissue. While this effect is minimal for Cu, which is largely protein-bound, it can be more significant for Fe and especially Zn, both of which have notable intracellular labile pools. In the future, increasing the size of the capture tube should significantly reduce impact of paraffin at the cost of resolution (i.e. from 1-micron to 5-10 micron resolution). Single-cell analysis was not feasible at this stage due to the 5-micron separation between the EI and ST sections, as well as the laser-destructive nature of the EI process. In our integrative analyses, metal abundance was aggregated at a 55-micron resolution, the same area covered by the Visium spot at the subsequent section—a level we deemed appropriate for combining elemental and transcriptomic data across adjacent tissue sections, while accounting for microarchitectural variability. Future work may further enhance the spatial resolution and improve integration across serial sections by more precisely characterizing tissue changes between closely spaced slices [89]. It should also be noted that due to the atomization of particles, LA-ICP-TOF-MS was not developed to characterize valence states. However, further understanding of speciation components that catalyze biomolecular interactions may be accomplished by pairing this technique with others, such as X-ray absorption near-edge structure (XANES) [90–93]. Analyzing smaller tissue regions at higher resolution may help deduce specific species, though performing these analyses at scale remains challenging.
This approach serves as a proof-of-concept workflow, demonstrating how the spatial integration of metals and gene analysis can soon be used to effectively capture the biological processes governing tumor metastasis, recurrence, and survival. This study paves the way for comprehensive exploration of spatial elemental data and gene expression in colorectal cancer and other tumor types, offering opportunities to advance therapeutic development, understand the biological and prognostic significance of elemental shifts, and investigate the impact of dietary intake on metal redistribution in early onset and progression of CRC.
Methods
Cohort curation and data collection
In our study, we focused on a specific patient, selected randomly from a cohort of 45 specimens recently profiled using Visium Spatial Transcriptomics (ST). The selected patient, a 55-60 year-old female, had a left colon microsatellite stable (MSS) tumor (intact MLH1, MSH2, MSH6, and PMS2 expression assessed through immunohistochemistry) that metastasized to the liver. The tissue sections were processed using the Sakura Tissue-Tek Prisma Stainer for hematoxylin and eosin staining. For ST profiling, 5 μm tissue segments were sectioned and dissected from formalin fixed paraffin embedded tissue blocks, placed within a 11 mm by 11 mm capture region. A separate unstained 10 μm serial section was cut for elemental imaging. Finally, a subsequent 5 μm section was left intact without macrodissection and was stained with H&E. The slides were scanned at 40X magnification (approximately 0.25 micron per pixel) using the Aperio-GT450 scanner (Leica, Wetzlar, Germany). The resulting hematoxylin and eosin-stained images were stored in SVS format with eight-bit color channels.
Elemental imaging
For the elemental profiling of the colorectal cancer tissue section, we used Laser Ablation Inductively Coupled Plasma Time-of-Flight Mass Spectrometry (LA-ICP-TOF-MS) [94]. This method represents a significant advancement in spatially resolved elemental imaging, offering both enhanced resolution and analytical speed. In this process, a pulsed laser is used to ablate minute portions of the tissue sample. The ablated material, now in particulate form, is then carried via a helium gas stream into the mass spectrometer. The key feature of LA-ICP-TOF-MS is its use of Time-of-Flight (TOF) technology, enabling rapid and comprehensive elemental detection across the entire periodic table. Operating at high frequencies (500-1000 Hz), the LA-ICP-TOF-MS system at the Biomedical National Elemental Imaging Resource (BNEIR) allows for detailed mapping of elemental distribution with ultra-high resolution, down to 1 µm. This capability is crucial for accurately capturing the complex elemental landscape within the tissue, providing insights into the spatial relationships and concentrations of various elements. By utilizing LA-ICP-TOF-MS, we were able to conduct an untargeted yet thorough profiling of the tissue section, yielding detailed data on its elemental composition. The sample was paraffin-embedded (including alcohol dehydration) prior to sectioning and profiling. Multi-point calibration was accomplished using a custom-made metal doped gelatin standard. Bovine heart tissue was also used as an internal standard. However, normalization to C12 was not performed in this experimental run due to the prior paraffin embedding. A pseudo-log transformation was applied to the elemental distribution data to reduce the impact of extraneous or extreme values. We also experimented with normalization using total summed metal concentration but ultimately decided against it, as it did not improve interpretability or consistency in this context.
Spatial transcriptomic profiling and spot-level cell-type deconvolution
We utilized the 10X Genomics Visium CytAssist spatial transcriptomics (ST) assay for in-depth profiling of a tissue section [95]. The tissue preparation and staining protocol included the following steps: (1) Mounting and Incubation: FFPE tissue sections were mounted on standard histology slides, dried at 42°C, and incubated at 60°C; (2) Deparaffinization and Staining: Slides underwent deparaffinization, rehydration, and H&E staining using the Sakura Tissue-Tek Prisma Stainer (Sakura Finetek USA, Torrance, CA). Coverslips were applied with a glycerol and xylene mounting medium; (3) Imaging: Whole-slide images (WSIs) were captured at 40x magnification, with a resolution of 0.25 μm/pixel, using Aperio GT450 scanners; and (4) Coverslip Removal: Coverslips were detached by immersing slides in xylene for 1–3 days followed by destaining and decrosslinking according to the 10x Genomics protocol (CG000520). Slides were then prepared for the Visium CytAssist workflow, including probe hybridization, ligation, probe transfer to Visium slides, and library preparation following the 10x Genomics protocol (CG000495). This includes an eosin staining step that enables alignment of the tissue with fiducials on the Visium slide using the CytAssist instrument, to coregister tissue morphology (H&E) with transcriptome data. Sequencing was conducted on an Illumina NovaSeq platform targeting a depth of 50,000 reads per spot. This facilitated unbiased, gridded spatial profiling of transcripts across the capture area. Spaceranger v1.0.0 software was used for alignment of Visium-ST sections with corresponding WSIs, quality control, and storage of Visium-ST data in AnnData format. Pathologists annotated the tumor’s interior, periphery/interface (tumor-interface—the region immediately adjacent to the tumor, transitioning into benign tissue), and surrounding benign architectures (serosa, muscularis propria) using the QuPath tool, permitting delineation of transcriptomic profiles in various tissue regions. Label propagation was used to refine unassigned annotations. To further characterize the tissue architecture, Visium data underwent dimensionality reduction via UMAP projection, which served as a precursor to graph-based Leiden clustering [96, 97]. Subsequent refinement of cluster assignments was conducted through label propagation based on the spatial coordinates, specifically targeting areas with high entropy to enhance spatial consistency of the cluster labels. Finally, the delineated clusters were superimposed onto the whole slide images, with labels assigned in accordance with the tissue architecture as judged by a pathologist.
To understand the cellular composition within the CRC tissue, we combined spatial transcriptomics with single-cell RNA-Seq data collected from a serial section. We utilized the Chromium Flex assay for single-cell profiling of disaggregated FFPE tissue sections from specific capture areas, employing the same transcriptomic probe set as the Visium assay. This approach revealed diverse cell types within the tissue, following the manufacturer’s Demonstrated Protocol (CG000784 RevB & CG000787 RevA). The generated data, processed using CellRanger v7.1.0, provided quality control metrics and cells-by-genes expression matrices for downstream analysis. For label transfer, cell types from a public single-cell RNA sequencing (scRNA-seq) dataset [98], specifically categorized cells from the Colon, were used. Cell types were grouped into broader categories. Dendritic Cells encompassed cDC1, cDC2, Lymphoid DC, and pDC cells. Epithelial Cells, which in this context are representative of tumor cells, also leveraged signatures defining Goblet cells, Colonocytes, and Enterocytes, and so forth. Endothelial Cells were grouped to cover a range of arterial, venous, and lymphatic subtypes. Macrophages were categorized based on LYVE1 + and MMP9 + subtypes, while Mast Cells included both Mast cells and CLC + Mast cells. Mesenchymal Cells, representing the tumor and normal fibrous stroma, included stromal cells, myofibroblasts, and pericytes. T Cells aggregated across CD4, CD8, and NK lineages. Plasma Cells included IgA and IgG subtypes. Additionally, Neuronal Cells, Monocytes, Neutrophils, Megakaryocytes, B Cell Precursors (including Immature B, Pro-B, Pre-B, and CLP cells), B Memory Cells, Proliferating B Cells and B Naive Cells, and Red Blood Cells were each treated as distinct categories. These cell type labels were transferred to our in-house CRC scRNA-seq dataset collected from a serial tissue section using SCVI (Single-Cell Variational Inference) framework [99, 100], which leverages a denoising variational autoencoder (VAE) trained to infer cell types. We employed the Cell2Location package for spot-level deconvolution, using the scRNA-seq data as a reference to estimate cell-type proportions/abundances in each spot [101]. This regression-based approach enabled us to spatially map the distribution of cell types within the CRC tissue section, yielding aggregate spot wise cellular abundances. It should be noted that we did not incorporate cellular morphology into our analysis; specifically, cell size was not included as a covariate. As a result, differences in the physical size or volume of specific cell types—which may influence transcript abundance—could introduce bias in cell-type abundance estimates. A morphological analysis at the scale necessary to estimate and adjust for cell-type-specific volumes within each spatial spot was beyond the scope of this study.
Single cell RNASeq and spatial transcriptomics specimen and data preprocessing
Prior to scRNASeq and ST profiling, RNA quality was assessed through capillary electrophoresis on a Fragment Analyzer instrument (Agilent).
After profiling, single-cell RNA-seq data were processed using the Scanpy framework. Low-quality cells and lowly expressed genes were filtered out, and mitochondrial gene content was used as a quality control metric. Cells with high mitochondrial content or extreme gene counts were excluded, as were mitochondrial genes. Gene expression counts were normalized to 10 000 unique molecular identifiers (UMIs) per cell and pseudo log-transformed. Highly variable genes were selected, technical effects (total counts and mitochondrial percentage) were regressed out, and data were scaled. Dimensionality reduction was performed using PCA, followed by neighborhood graph construction and UMAP embedding for visualization. Spatial transcriptomics data underwent a similar pipeline, adapted top remain compatible with downstream modeling (e.g. MISTy, MEFISTO). Mitochondrial genes were annotated and removed to minimize technical artifacts. Genes with low or sparse expression were filtered out. Data were normalized to 10 000 counts per spot and pseudo log-transformed. For spatial cell-type mapping, highly variable genes were selected, and from this set of genes, the top spatially variable genes were retained for spatial integration with single-cell data.
Quality control and co-registration via trace:
Recently, our team developed TRACE, a software tool under the Biomedical National Elemental Imaging Resource, specifically designed to co-register highly multiplexed elemental assays with tissue slides [41, 46, 47]. TRACE enables comprehensive spatial multimodal tissue analysis by integrating spatial elemental and transcriptomic data. In this colorectal cancer study, TRACE was instrumental in co-registering multi-channel elemental images with whole slide images (WSI). Initially, TRACE’s preprocessing workflow aggregated elemental abundance across channels and user defined thresholding through interactive segmentation of background regions to accurately detect tissue in the elemental maps. Refinement of tissue detection involved Gaussian filtering for reduction of outliers, which can impact tissue detection, and morphological operations (binary opening/closing) to further refined these images, focusing on removing noise and defining contiguous regions for analysis (Figure S7A).
A key challenge in preprocessing was addressing directional stripe artifacts in elemental imaging. To tackle this, we employed anisotropic diffusion filtering to reveal sharp edge patterns in the tissue followed by the probabilistic Hough transform which is a line fitting and detection for isolating detected strips [102, 103]. Line detection was followed by morphological transformations to remove the artifacts (Figure S7B-F). This meticulous process ensured elemental maps were free of distortions that could impact the analysis. We also encountered metal washout at tissue edges, particularly for elements like iron, and implemented a targeted erosion technique to address this issue. This strategy selectively eroded the edges of the tissue mask, effectively minimizing edge-related distortions and ensuring a more accurate representation of elemental distribution within the core tissue areas (Figure S7G).
For co-registration, a landmark-based approach with Homography matrix transformation aligned the elemental maps with the ST-associated WSI. Selecting 30 manual fiducials that marked structural similarities between the elemental maps and WSI enabled us to precisely overlay the elemental map onto the WSI. Recall that ST, single cell data and pathologist annotations had been mapped to these same H&E WSI, providing a common reference frame for integration with the elemental maps. TRACE exports a SpatialData file containing elemental abundance and tissue region annotations supplied and integrated using QuPath [104, 105]. Using the Nearest Neighbors algorithm (capturing adjacent pixels within radius of Visium spot), we assigned pixel coordinates from the elemental maps to the nearest Visium spots. Elemental pixel values were aggregated within spot, combining elemental and transcriptomic data onto a unified frame using SpatialData (interoperable with Anndata and Muon) data formats for in-depth analysis [106].
Hotspot analysis
We represented the spatial distribution of elemental metal concentrations within tissue sections by identifying areas of concentrated metal abundance. This was achieved by calculating hotspots using the Getis-Ord Gi* statistic [49], implemented through the pySAL package [107]. The Getis-Ord Gi* statistic, a spatial statistic, evaluates the metal concentration in each pixel or spot in relation to its neighbors (taken to be spots within approximately 100 micron, the center-to-center distance between adjacent spots), producing spot-level z-scores and p-values. These scores helped us discern statistically significant areas where metal concentrations were either unusually high (hot spots) or low (cold spots) compared to the expected local average. Our approach involved permutation testing and normality assumptions to ensure the robustness of identified hotspots. Edge effects (comparison of Visium spots at the tissue’s edge) and multiple comparison adjustments for spot-level z-scores were not considered nor required in this analysis. In general, the Getis-Ord Gi* statistic correlated strongly with raw elemental abundances, with Spearman correlation coefficients ranging from 0.93 to 0.98 across elements (e.g. Ca-44: 0.93, Cu-63: 0.97, K-41: 0.97, Mg-24: 0.96, Fe-56: 0.96, Zn-66: 0.98, Mn-55: 0.94). Given this strong concordance, we used Gi* z-scores throughout the remainder of the manuscript as our primary metric for representing elemental hotspotting and localized abundance. In general, given the selected length scale, the Gi* z-score mirrors local abundance. Comparisons of metals based on their hotspotting locations was not done in this work, therefore, normalization was not needed.
Association of hotspots with tissue histology and cell-type abundances
Linear regression models were used to associate the hotspot Gi* z-scores representing elemental abundance (dependent variable) with specific tissue architectures (four regions—cancer, serosa, interface, muscularis) represented as categorical fixed effects. Post-hoc pairwise comparisons between tissue architectures (e.g. tumor vs. interface) were conducted using estimated marginal means (R v4.1, emmeans package), which calculated mean Gi* statistics (hotspot concentrations) for each tissue type/architecture [108]. The mean statistics and their relative differences estimated through the linear modeling were hierarchically clustered to reveal similarities between elements in their distribution. Similarly, spatially-integrated cell-type proportions were associated with varying metal concentrations represented using Gi* statistics using spearman correlations. The spearman correlation matrix between metals and cell-types were hierarchically clustered to reveal metals with similar cell-type associations. Differences were visualized with boxplots, dotplots and clustered heatmaps with dendrograms.
Spatial gene expression correlations with metal abundance and pathway analysis
Spearman correlations were employed to compare spatial gene expression with elemental abundance, linking Gi* z-scores representing transformed elemental abundance as our elemental features with spatial transcriptomics across the entire transcriptome for each metal. Results were visualized using volcano plots [109]. After Bonferroni adjustment to account for multiple comparisons (alpha significance level of 0.05/18 074 for 18 074 genes tested), we selected the 150 top correlated genes for each element, ranked by their adjusted p-values (all 150 genes surpassed the Bonferroni adjustment for all metals). Separate sets of 150 genes were selected based on whether correlations were positive, negative or based on the magnitude of the correlation. For each metal, pathway analyses using the Enrichr package were conducted across various pathway databases (including MsigDB Cancer Hallmarks 2020, Reactome 2022) for genes associated with positive and negative elemental concentration changes, and separately for genes regardless of whether their test statistics were positive or negative [110]. Pathway analyses were only conducted for a select set of metals which did not warrant additional multiple comparison adjustment. Using PathVisio, metal gene correlations for Cu and Fe were also visualized atop genes involved in Copper Homeostasis and Epithelial-to-Mesenchymal related pathways, downloaded from WikiPathways [51–56].
Integrating spatial multimodal analysis to profile elemental, genetic, and cellular co-localization and their interactions
MEFISTO is a computational approach for analyzing multi-modal spatial biological data, unraveling latent factors that capture spatial variations linked to genes, metals, and cell types [67]. It leverages tensor factorization alongside spatial and hierarchical Gaussian Processes to handle spatial autocorrelation, uncovering profiles that include gene expressions, elemental concentrations, and cell-type distributions. To refine the model, we limited factors to at most 20 factors, employed spike-and-slab priors for sparsity, and set Gaussian likelihoods for each data type with pseudo-log transformation for elemental abundances and cell type abundances, and Gi* z-scores for EI. For efficient training, we used a fast convergence mode leveraging sparse Gaussian processes, discarded low-impact factors, and harnessed GPU acceleration. MEFISTO automatically prunes factors that contribute little to the variance of the data through variance thresholding and automatic relevance determination priors. The initial choice of 20 factors, which MEFISTO ultimately refined down to 9, was made as a cautious, data-driven strategy to allow the model flexibility while avoiding underfitting. Post-training, we examined the factors associated with specific tissue architectures using linear modeling on the spot-level factor loadings, similar to the above, and conducted a pathway analysis (Enrichr; MsigDB Cancer Hallmarks 2020, Reactome 2022) based on factor associated genes with an FDR adjusted p-value less than 0.05.
Unlike MEFISTO, which focuses on unraveling latent factors in multi-modal data, MISTy analyzes the spatial interactions among different features like genes, elements, and cell types. MISTy employs a multivariable model to determine how features including pathway activities, genes, and cell types relate to elemental abundance, using a spatially weighted approach [71]. Specifically, it assesses the predictiveness of these factors on elemental abundance, employing a squared exponential decay kernel to account for spatial proximity. Pathway activity was estimated using the Progeny database through the Decoupler package [111, 112]. Spatially-informed random forest models were fit to predict elemental abundance from the spatial transcriptomic data types. The ability of the ST datatypes to explain elemental variation throughout the slide was estimated through the calculation of an element-specific R² gain score. MISTy broke this score down by each of the ST datatypes (scRNASeq/cell proportions, ST, pathway activity), representing their relative contributions. Contributions from other elements were included in the model (other than the one being estimated)—a new R² gain score was estimated which demonstrated the relative contributions of ST data versus the mixture/associations with other elements. Predictive elements, genes, pathways and cell-types were provided using feature importance scores via the impurity-based Gini index. Further application of this tool could explore relationships between different modalities at the same location (para-view) and neighboring locations (juxta-view)—the complexity of this analysis is outside of the scope of this work and has been saved for follow up studies.
Additional information on elemental data preprocessing and analysis in the presence of spatial effects
Traditional preprocessing techniques used in elemental imaging (e.g. Gaussian blurring), which are conceptually different from Gi* statistics, were not applied, as we felt such blurring could obscure sharp spatial transitions and reduce interpretability. Aside from the factor analysis (MEFISTO) and multi-view modeling (MISTy), we did not adjust for spatial autocorrelation using methods such as Gaussian process regression with spatial exponential decay kernels or Bivariate Moran’s I. Instead, we prioritized more parsimonious, interpretable approaches (e.g. Spearman correlations), which are robust to non-linear relationships and better suited for exploratory spatial analyses, with prior precedent. A systematic comparison of spatial scales for Gi* calculation was considered out of scope, though we acknowledge that different spatial resolutions can influence hotspot detection. Each element’s spatial pattern was analyzed independently, and formal comparisons across elements were not conducted, as the number of adjusted transcript-level comparisons far exceeded the number of elements being evaluated.
Post-hoc exclusion of elements and isotopes
All detectable elements underwent spatial multimodal analysis. However, certain elements and select isotopes were removed post-hoc to ensure the reliability and biological relevance of the findings. Specifically, elements such as Na, Cd, Cr, V, As, Mo, Gd, Ru, Pt, Al, Ag, Se, Pb, Ba, Ni, Sb, Co, and Tl were excluded, despite their detection across the slide. These elements were removed to account for potential influences from tissue preparation processes, biological buffers, or individual variation, which could disproportionately impact their representation. Additionally, some of these elements, while detected, were not previously hypothesized to exist at significant levels in the colon. Although their presence here is informative and warrants future exploration, we chose to exclude them to avoid overstating conclusions or misinterpreting findings in the current study. This selective approach ensures the remaining set of elements aligns with the study’s focus on tumor-related elemental interactions and provides a foundation for more targeted analyses in future work.
Supplementary Material
mfaf034_Supplemental_Files
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Wong MC, Huang J, Lok V et al. Differences in incidence and mortality trends of colorectal cancer worldwide based on sex, age, and anatomic location. Clin Gastroenterol Hepatol. 2021;19:955–66. 10.1016/j.cgh.2020.02.02632088300 · doi ↗ · pubmed ↗
- 2Kasi PM, Shahjehan F, Cochuyt JJ et al. Rising proportion of young individuals with rectal and colon cancer. Clin Colorectal Cancer. 2019;18:e 87–95. 10.1016/j.clcc.2018.10.00230420120 · doi ↗ · pubmed ↗
- 3Patel SG, Ahnen DJ. Colorectal cancer in the young. Curr Gastroenterol Rep. 2018;20:1–12. 10.1007/s 11894-018-0618-929616330 · doi ↗ · pubmed ↗
- 4Coradduzza D, Congiargiu A, Azara E et al. Heavy metals in biological samples of cancer patients: a systematic literature review. Biometals. 2024;37:803–817. 10.1007/s 10534-024-00583-438347295 PMC 11254964 · doi ↗ · pubmed ↗
- 5Tang D, Kroemer G, Kang R. Targeting cuproplasia and cuproptosis in cancer. Nat Rev Clin Oncol. 2024;21:370–88. 10.1038/s 41571-024-00876-038486054 · doi ↗ · pubmed ↗
- 6Ge EJ, Bush AI, Casini A et al. Connecting copper and cancer: from transition metal signalling to metalloplasia. Nat Rev Cancer. 2022;22:102–13. 10.1038/s 41568-021-00417-234764459 PMC 8810673 · doi ↗ · pubmed ↗
- 7Villareal LB, Falcon DM, Xie L et al. Hypoxia-inducible factor 3α1 increases epithelial-to-mesenchymal transition and iron uptake to drive colorectal cancer liver metastasis. Br J Cancer. 2024;130:1–12.38693428 10.1038/s 41416-024-02699-3PMC 11183190 · doi ↗ · pubmed ↗
- 8Xue X, Taylor M, Anderson E et al. Hypoxia-inducible factor-2α activation promotes colorectal cancer progression by dysregulating iron homeostasis. Cancer Res. 2012;72:2285–93. 10.1158/0008-5472.CAN-11-383622419665 PMC 3342485 · doi ↗ · pubmed ↗
