Very High vs. High Tumor Mutational Burden Across Tumors: Real-World Associations with MSI, Pathway Features, and Immunotherapy Outcomes
Maria Fernanda Teixeira, Victoria Tomaz, Lucas Campos Barbosa e Silva, Uelson Donizete, Francisco Tustumi, Helder Imoto Nakaya, Juliana Rodrigues Beal, Fernando Moura, Mitesh J. Borad, Paulo Vidal Campregher, Pedro Luiz Serrano Uson Junior

TL;DR
This study finds that very high tumor mutational burden is strongly linked to microsatellite instability but does not improve immunotherapy outcomes compared to high TMB.
Contribution
The paper introduces a real-world analysis of extreme TMB thresholds and their biological and clinical implications beyond standard classifications.
Findings
Very high TMB was strongly associated with microsatellite instability (MSI) compared to high TMB.
No significant differences in immunotherapy response or survival were observed between TMB-VH and TMB-H groups.
Pathway-level genomic features did not show statistically significant differences between TMB groups.
Abstract
Background: Tumor mutational burden (TMB) is an FDA-approved biomarker for immune checkpoint inhibitor (ICI) therapy. However, its predictive value varies among tumor types and molecular contexts. We investigated whether a very high TMB identifies a biologically distinct subset and whether a higher cutoff provides additional clinical insights beyond the conventional high TMB threshold. Methods: We analyzed 133 patients with advanced solid tumors and TMB ≥ 10 mutations/Mb (mut/Mb) who underwent tumor genomic profiling using a 523-gene DNA/RNA next-generation sequencing panel. Tumors were stratified into prespecified TMB categories: 10–20 mut/Mb (TMB-H) and >20 mut/Mb (TMB-VH). The clinical characteristics, ICI outcomes (in the treated subset), and pathway-level genomic features were compared between groups. Results: TMB-VH was observed in 42/133 (31.6%) patients and spanned more than 20…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer Immunotherapy and Biomarkers · Cancer Genomics and Diagnostics · CAR-T cell therapy research
1. Introduction
TMB, broadly defined as the number of somatic variants per coding area of the tumor genome, has emerged as a promising biomarker for predicting the response to immune checkpoint blockade in multiple cancer types [1]. The biological rationale is intuitive: a higher mutational load increases the likelihood of neoantigen formation, which may enhance tumor immunogenicity and increase visibility to the immune system [2]. Clinically, this association has been most consistent in malignancies such as melanoma, non-small cell lung cancer, and urothelial carcinoma, where a higher TMB has been linked to improved outcomes with immunotherapy. However, this relationship is not uniform across tumor types, and some cancers do not show a clear correlation between neoantigen load and CD8+ T-cell infiltration [3,4].
KEYNOTE-158, a multicohort single-arm phase II trial, reported a higher objective response rate with pembrolizumab in TMB-H tumors (≥10 mut/Mb by FoundationOne CDx) than in non-TMB-H tumors (29% vs. 6%) [5]. Although the non-randomized design limits definitive conclusions, these findings support the FDA’s tumor-agnostic approval of pembrolizumab for TMB-H solid tumors [6].
Despite its regulatory approval, the broad adoption of TMB as a tumor-agnostic biomarker remains challenging. Technical variability across platforms and thresholds, along with an incomplete understanding of the biological determinants of immune sensitivity, continue to limit consistent clinical interpretation [7,8]. The relationship between TMB and immune responsiveness is far from linear and appears to be highly context-dependent [9]. TMB-VH thresholds have been explored in previous pan-cancer studies. However, the distinctive contribution of the present study is a pathway-level view of tumor biology, focusing on metabolic and signaling programs that co-occur with extreme hypermutation.
There is a growing need to complement somatic variant counts with a more mechanistic understanding of how TMB aligns with tumor-intrinsic biology and the immune microenvironment. The mutation load may reflect broader genomic instability and coexisting features such as immune infiltration, antigen presentation capacity, and tumor metabolism. While metabolic reprogramming is increasingly recognized as a determinant of immunogenicity and immune evasion [10,11], the interplay between these programs and extreme TMB remains underexplored. Our primary objective was to compare immunotherapy outcomes and pathway-level genomic profiles between tumors with TMB-H and TMB-VH in a real-world cohort. The secondary objective was to describe the gene-level context associated with TMB-VH values.
2. Methods
2.1. Study Design and Population
This retrospective, single-center study was conducted at a tertiary cancer center. We screened all patients with advanced or metastatic tumors who underwent tumor genomic profiling using next-generation sequencing (NGS) at our institution between March 2017 and August 2023. Patients with TMB ≥ 10 mut/Mb were eligible for the molecular analyses. For clinical outcome analyses, we included a subset of patients with metastatic disease who received at least one dose of ICI and had complete clinical follow-up data available for survival analysis. Follow-up was conducted until January 2025.
2.2. Clinical Data Collection
Clinical variables (age, sex, self-reported race, primary diagnosis/histology, stage at diagnosis, prior therapies, and ICI regimen details) were abstracted from electronic medical records. Staging followed the American Joint Committee on Cancer (AJCC) 8th edition.
2.3. Genomic Profiling and Biomarker Definitions
Tumor profiling was performed using a targeted DNA- and RNA-based next-generation sequencing (NGS) panel covering 523 genes (approximately 1.28 Mb in length). Sequencing was performed on an Illumina platform using a clinical laboratory-validated workflow. Patients were stratified a priori into TMB categories for analysis: 10–20 mut/Mb (TMB-H) and >20 mut/Mb (TMB-VH). TMB and microsatellite instability (MSI) were derived from a hybrid capture tumor-only NGS assay using a laboratory-validated bioinformatic pipeline. Germline testing results were recorded when available but were not routinely performed for all patients.
2.3.1. TMB Calculation
TMB was reported as mut/Mb and calculated as the number of eligible somatic variants divided by the effective panel size. Eligible variants consisted of coding-region single-nucleotide variants (SNVs) and small insertions/deletions (indels), excluding multi-nucleotide variants (MNVs). Variants were required to meet the minimum technical thresholds (variant allele fraction ≥ 5% and coverage ≥ 50×) and pass laboratory quality filters.
Common germline polymorphisms were removed using population databases and additional post-database allele frequency/proximity filtering rules that were implemented in the software. Specifically, variants with an observed allele count ≥ 10 in any of the gnomAD exome, gnomAD genome, or 1000 Genomes were excluded, along with additional allele frequency/proximity rules, as described in the software documentation. Variants with a COSMIC count ≥ 50 were excluded from the numerator. Somatic status and clinical classification followed the institutional laboratory workflow aligned with AMP/ASCO/CAP standards. For TMB computation, variants were counted irrespective of the clinical tier (including variants of unknown significance), consistent with tumor-agnostic TMB definitions. The effective panel size denominator was defined as the total coding territory with coverage > 50×, after excluding low-confidence regions in which variants were not called.
2.3.2. MSI Assessment
MSI status was derived from the same tumor-only NGS data using the assay’s MSI module. This approach evaluates a set of microsatellite loci for instability relative to a baseline set of normal samples using information entropy metrics and generates a sample-level score reflecting the proportion of unstable sites among the assessed loci. The MSI classification threshold and baseline normal set were assay-configurable parameters established during laboratory validation and were not available in clinical reports. Therefore, MSI was analyzed as reported by the clinical laboratory (MSI-positive/MSI-H vs. MSS) without applying an investigator-defined cutoff.
2.4. Pathway Mapping
Somatic alterations (including pathogenic/likely pathogenic SNV/indel/CNV/fusion) were assigned to the predefined biological pathways (Figure 1). A pathway was considered “altered” if at least one gene in the pathway gene set harbored an eligible somatic alteration detected using the NGS platform.
2.5. Treatment Response Assessment
Response assessments were based on routine cross-sectional imaging performed in clinical practice, predominantly contrast-enhanced CT, with MRI used when clinically indicated. Imaging was performed per routine clinical practice, when available, every 8–12 weeks during ICI therapy; however, as a retrospective real-world cohort, the imaging schedules and assessment intervals were not uniform. Response was evaluated using RECIST v1.1 by institutional radiologists blinded to the molecular results.
2.6. Statistical Analysis
Continuous variables were assessed for distributional assumptions and summarized as mean (SD) when approximately normal or as median (IQR) when non-normal. Group comparisons used Welch’s t-test for normally distributed variables with unequal variances, Student’s t-test when variances were similar, and the Mann–Whitney U test for non-normally distributed variables. Categorical variables were compared using the χ^2^ or Fisher’s exact test, as appropriate. Exact two-sided p-values were reported.
Binary pathway alteration variables were compared between the TMB groups using two-sided Fisher’s exact tests because of sparse 2 × 2 tables. Odds ratios (ORs) with 95% confidence intervals and phi coefficients were reported as effect sizes for the analyses. Raw p-values are reported alongside the Benjamini–Hochberg false discovery rate (FDR)-adjusted q-values.
The primary endpoint was OS. Progression-free survival was not analyzed because of the limitations of retrospective capture, including heterogeneous imaging intervals and incomplete documentation of radiographic progression in routine practice. Analyses were restricted to ICI-treated patients with complete follow-up.
To mitigate the dilution of treatment-specific effects, OS was evaluated using two time origins: metastatic diagnosis and ICI initiation. Survival time was defined as the time from the relevant origin to death from any cause and was censored at the last follow-up. Kaplan–Meier curves were compared using the log-rank test. Cox proportional hazards models evaluated associations with prespecified covariates (age, clinical stage, MSI status, and TMB group); proportional hazards assumptions were assessed using Schoenfeld residuals. Results are reported as hazard ratios (HRs) with 95% confidence intervals.
As this was a retrospective cohort study with a fixed sample size, no a priori power calculations were performed. Analyses were hypothesis-generating and may have been underpowered for subgroup and pathway-level comparisons. Statistical analyses were performed using R 4.3.2 (R Foundation for Statistical Computing, Vienna, Austria). Statistical significance was defined as a two-sided p < 0.05. For pathway comparisons, FDR-adjusted q-values were used.
3. Results
From 1609 screened patients, 133 patients with advanced or metastatic tumors and TMB ≥ 10 mut/Mb underwent tumor genomic profiling using an extended DNA/RNA NGS platform. Patients were stratified into two prespecified categories based on TMB: 10–20 mut/Mb (TMB-H; n = 91) and >20 mut/Mb (TMB-VH; n = 42) (Table 1). The mean age at testing was 67 years, and 72/133 (54%) were male; age did not differ between the groups (Welch’s t-test p = 0.647). TMB values were right-skewed and are therefore presented descriptively; no hypothesis test was performed for TMB because it defined the comparison groups.
MSI was detected in 18/133 (14%) patients overall and was more prevalent in the TMB-VH group (38.1% vs. 2.2%; Fisher’s exact p = 8.9 × 10^−8^). The clinical stage at diagnosis, race, and diagnosis distributions were comparable between groups (Table 1). A detailed breakdown of the diagnoses is presented in Supplementary Table S1. The ICI regimens and response variables are listed in Supplementary Table S2.
To contextualize the extreme TMB at the gene level, we evaluated TMB distributions across recurrently altered genes (Figure 2). Several genes (e.g., GNAS, ROS1, and ALK) had a higher proportion of cases in the TMB-VH category, whereas others (e.g., KRAS and MET) were more frequently observed in the TMB-H category. Tumor suppressor alterations (e.g., TP53, RB1, and ATM) were observed in both strata. Figure 2 summarizes the dispersion of TMB values by gene context, including outliers in the hypermutated range.
Pathway-level comparisons did not identify statistically significant differences in alteration frequencies between the TMB groups after multiple testing corrections (all q > 0.05; Table 2). Several metabolic and signaling pathways showed numerically lower alteration frequencies in the TMB-VH group (including central carbon metabolism in cancer, C-type lectin receptor signaling, and JAK–STAT signaling) with small effect sizes (phi ≈ 0.06–0.18). Given the lack of significance after FDR correction, pathway-level results were reported descriptively (Table 2). Sensitivity analyses excluding MSI, POLE, and POLD1 somatic alterations yielded similar results. Additional pathway-level visualizations are shown in Figure 3 and Supplementary Figures S1–S4.
Among the 133 patients, 66 received at least one dose of ICI in the metastatic setting (Supplementary Table S2). Pembrolizumab was the most frequently used agent, followed by atezolizumab and ipilimumab/nivolumab. The objective response rate (complete or partial response) did not differ significantly between the TMB groups (Supplementary Table S2).
To address potential lead-time bias related to the pre-ICI disease course, OS was evaluated in the ICI-treated subset using two clinically relevant time points. When defined from the metastatic diagnosis, OS did not differ significantly between the TMB groups (log-rank p = 0.937; HR 1.03, 95% CI 0.52–2.01) (Figure 4A). The median OS was 37.3 months for TMB-VH and 22.6 months for TMB-H (Figure 4A). When OS was defined from ICI initiation, the results were consistent, with no significant difference observed between the groups (log-rank p = 0.814; HR 0.92, 95% CI 0.46–1.83); the median OS was 23.4 months for TMB-VH and 17.4 months for TMB-H (Figure 4B).
In the multivariable Cox regression, clinical stage was independently associated with worse OS, whereas age, MSI status, and TMB group were not independently associated with OS. TMB-VH showed a non-significant trend toward improved survival (HR < 1) (Supplementary Figure S4).
4. Discussion
In this study, we examined the clinical outcomes and biological correlations of TMB in patients with advanced solid tumors by comparing TMB 10–20 mut/Mb with TMB > 20 mut/Mb. Two findings were most consistent in our dataset: MSI was substantially enriched among tumors with TMB > 20, and ICI-treated patients did not demonstrate a statistically significant OS advantage in the very high TMB group. Although Kaplan–Meier curves suggested numerically longer OS for TMB > 20 across both time origins (from metastatic diagnosis and ICI initiation), these differences were not statistically significant, and multivariable modeling did not identify the TMB category as an independent predictor of survival after adjustment for age, MSI status, and clinical stage.
These findings support a cautious and context-aware interpretation of TMB when applied as a stand-alone biomarker in heterogeneous real-world populations. Although TMB has been associated with ICI benefits at the population level in selected settings, its performance remains inconsistent across histologies, treatment lines, and sequencing platforms [12]. Tumor-agnostic application of fixed TMB thresholds risks oversimplifying the biological heterogeneity that affects immunogenicity and therapeutic sensitivity, including differences in tumor microenvironment composition, mutational processes, and tumor-intrinsic programs that shape antigen presentation and immune evasion [13,14]. In practical terms, a numeric TMB threshold captures only one dimension of a coupled immunobiological system in which the neoantigen burden, immune contexture, and tumor-intrinsic states collectively determine whether antigenicity translates into effective antitumor immunity.
The principal novelty of this study is the integration of TMB stratification with a pathway-level functional lens, emphasizing metabolic and signaling programs rather than treating TMB as a quantitative surrogate for the neoantigen load. No pathway-level differences remained statistically significant after multiple testing corrections, underscoring that these analyses are hypothesis-generating rather than confirmatory. This is particularly relevant because pathways are not independent units; they overlap, share genes, and converge on common nodes. True effects may be distributed across interconnected networks and diluted when each pathway is tested separately.
Despite the absence of FDR-significant pathway differences, we observed descriptive patterns in which several metabolic and signaling pathways appeared to be altered less frequently among tumors with TMB > 20. One biologically plausible (but unproven) interpretation is that extreme hypermutation may arise within distinct tumor-intrinsic contexts and under specific evolutionary constraints. Hypermutation driven by mismatch repair deficiency, polymerase proofreading defects, or exogenous mutagens may increase variant counts while simultaneously reshaping the selective pressures on growth, stress adaptation, and metabolism. In this framework, a lower alteration frequency in specific pathways could reflect alternative routes to tumor fitness, negative selection against additional disruption in highly unstable genomes, or residual histology-associated differences within TMB-defined strata. These signals require validation in cohorts designed to explicitly model the pathway interactions.
We also observed histology-associated variations in TMB-VH prevalence, consistent with prior reports that cancers linked to mutagenic exposures (e.g., melanoma and NSCLC) more frequently harbor extreme hypermutation [15]. Gastrointestinal malignancies contributed fewer TMB-VH cases in our cohort; however, this study was not designed to estimate population incidence and reflects real-world testing patterns and referral practices at a tertiary center. Nevertheless, these patterns underscore why a single TMB cutoff may perform differently across tumor types and why tumor-aware calibration and context-specific interpretation are likely necessary for clinical use.
Gene-level visualization further supports the biological heterogeneity within TMB-H. Some recurrent alterations were associated with broader TMB ranges and occasional hypermutated outliers, whereas others clustered within narrower distributions. Although our analysis was descriptive and not powered for gene-by-gene inference, these patterns support the broader concept that “TMB-H” encompasses biologically heterogeneous states and that the processes driving hypermutation differ across tumors. Future work should test whether combining TMB magnitude with genomic context (e.g., DNA repair alterations, polymerase proofreading defects, and MSI signatures) improves prediction of ICI benefit beyond a single numeric threshold [7,16].
A key implication is that improved prediction will likely require integrative models that combine correlated dimensions (TMB magnitude, MSI status, gene-level context, and pathway/network states) while accommodating nonlinear interactions and histology-specific priors. Artificial intelligence (AI) approaches may be valuable in this regard as well. Machine learning methods can integrate multi-omics and pathway-level features, identify non-obvious associations, and model interactions that are difficult to pre-specify using conventional stratified analyses. In larger multicenter datasets, AI-enabled models could incorporate mutational signatures and DNA repair states, pathway/network embeddings, and clinical features to generate more individualized predictions of ICI benefit and clarify whether and when TMB-VH adds predictive value beyond MSI and other covariates.
This study has limitations. First, the retrospective single-institution design introduces selection bias and limits generalizability. Second, given the size of the ICI-treated subset (n = 66; TMB-VH, n = 26), the study was likely underpowered to detect modest hazard ratios, increasing uncertainty around null findings, particularly for pathway-level comparisons. Third, our pathway framework simplifies interdependent biological networks and may miss context-dependent interactions. Fourth, germline testing was not uniformly available, limiting the assessment of hereditary predisposition and its contribution to the hypermutation. Fifth, OS follow-up was relatively short for a subset of patients; accrual was extended through August 2023, and follow-up was censored in January 2025, leaving several patients with approximately 12–18 months of survival observation, limiting the precision of long-term estimates. Finally, treatment exposure and line of therapy were heterogeneous, and residual confounding was likely, despite multivariable adjustment. Progression-free survival was not analyzed because the imaging intervals and progression ascertainment were heterogeneous in this retrospective cohort; therefore, OS was prioritized as the most consistently captured time-to-event endpoint.
Despite these constraints, the contribution of this study is not to establish a definitive predictive utility for extreme TMB thresholds but to provide a real-world, pathway-oriented framework that generates testable hypotheses regarding the biological context of hypermutation. The inclusion of a histologically diverse cohort spanning more than 20 tumor types and multiple ICI regimens enhances the relevance of this study to contemporary precision oncology. Larger multicenter cohorts with standardized sequencing, harmonized TMB definitions, and more uniform treatment annotations are required to validate these descriptive patterns and clarify their clinical utility. Future studies should evaluate integrative biomarker models incorporating TMB alongside MSI status, gene-level context, and pathway-level features to refine ICI selection [11] and test whether metabolic and signaling programs linked to hypermutation can inform rational combination strategies in appropriately designed prospective studies.
5. Conclusions
In this single-center, hypothesis-generating, real-world cohort study, TMB-VH captured biologically heterogeneous states and was strongly enriched for MSI. Although no pathway-level comparison remained significant after FDR correction, the descriptive patterns in metabolic and signaling pathways motivate further investigation of the functional context alongside TMB. Future multicenter studies should validate these findings and test integrative biomarker frameworks to better predict the benefits of immune checkpoint inhibitors.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Yarchoan M. Hopkins A. Jaffee E.M. Tumor Mutational Burden and Response Rate to PD-1 Inhibition N. Engl. J. Med.20173772500250110.1056/NEJ Mc 171344429262275 PMC 6549688 · doi ↗ · pubmed ↗
- 2Goodman A.M. Kato S. Bazhenova L. Patel S.P. Frampton G.M. Miller V. Stephens P.J. Daniels G.A. Kurzrock R. Tumor Mutational Burden as an Independent Predictor of Response to Immunotherapy in Diverse Cancers Mol. Cancer Ther.2017162598260810.1158/1535-7163.MCT-17-038628835386 PMC 5670009 · doi ↗ · pubmed ↗
- 3Chalmers Z.R. Connelly C.F. Fabrizio D. Gay L. Ali S.M. Ennis R. Schrock A. Campbell B. Shlien A. Chmielecki J. Analysis of 100,000 human cancer genomes reveals the landscape of tumor mutational burden Genome Med.201793410.1186/s 13073-017-0424-228420421 PMC 5395719 · doi ↗ · pubmed ↗
- 4Mc Grail D.J. PiliéP.G. Rashid N.U. Voorwerk L. Slagter M. Kok M. Jonasch E. Khasraw M. Heimberger A. Lim B. High tumor mutation burden fails to predict immune checkpoint blockade response across all cancer types Ann. Oncol.20213266167210.1016/j.annonc.2021.02.00633736924 PMC 8053682 · doi ↗ · pubmed ↗
- 5Marabelle A. Fakih M. Lopez J. Shah M. Shapira-Frommer R. Nakagawa K. Chung H.C. Kindler H.L. Lopez-Martin J.A. Miller W.H.Jr. Association of tumour mutational burden with outcomes in patients with advanced solid tumours treated with pembrolizumab: Prospective biomarker analysis of the multicohort, open-label, phase 2 KEYNOTE-158 study Lancet Oncol.2020211353136510.1016/S 1470-2045(20)30445-932919526 · doi ↗ · pubmed ↗
- 6Marcus L. Fashoyin-Aje L.A. Donoghue M. Yuan M. Rodriguez L. Gallagher P.S. Philip R. Ghosh S. Theoret M.R. Beaver J.A. FDA Approval Summary: Pembrolizumab for the Treatment of Tumor Mutational Burden–High Solid Tumors Clin. Cancer Res.2021274685468910.1158/1078-0432.CCR-21-032734083238 PMC 8416776 · doi ↗ · pubmed ↗
- 7Leone R.D. Powell J.D. Metabolism of immune cells in cancer Nat. Rev. Cancer 20202051653110.1038/s 41568-020-0273-y 32632251 PMC 8041116 · doi ↗ · pubmed ↗
- 8Anderson N.M. Simon M.C. The tumor microenvironment Curr. Biol.202030 R 921R 92510.1016/j.cub.2020.06.08132810447 PMC 8194051 · doi ↗ · pubmed ↗
