Application of a preclinical 18-gene classifier to patients with locally advanced HNSCC
Steffen Löck, Lydia Koi, Kristin Gurtner, Fabian Lohaus, Max Kemper, Dominik Haim, Inge Tinhofer, Goda Kalinauskaite, Martin Stuschke, Maximilian Fleischmann, Claus Rödel, Anca-Ligia Grosu, Jürgen Debus, Claus Belka, Stephanie Combs, Simon Boeke, Gustavo Baretton

TL;DR
A preclinical 18-gene classifier for head and neck cancer is shown to predict treatment outcomes in patients when applied to real-world data.
Contribution
The study validates a preclinical gene classifier in human patients and links it to EGFR and loco-regional control.
Findings
The 18-gene classifier correlates with EGFR expression in HNSCC patients.
The classifier stratifies patients for loco-regional control in two cohorts.
Similar gene expression patterns were observed between xenografts and patient data.
Abstract
•Translation of 18-gene signature from HNSCC xenografts to two HNSCC patient cohorts.•Significant relation of the classifier to EGFR expression and LRC.•Similarities were observed between xenograft and patient data. Translation of 18-gene signature from HNSCC xenografts to two HNSCC patient cohorts. Significant relation of the classifier to EGFR expression and LRC. Similarities were observed between xenograft and patient data. In a previous preclinical trial on human HNSCC xenografts, 18 genes were predictive for adding cetuximab to fractionated radiotherapy. Here, we apply the corresponding 18-gene classifier to two HNSCC patient cohorts treated with combined radiochemotherapy (without cetuximab). We show that the classifier is related to EGFR expression and stratifies patients for loco-regional control (LRC) in both cohorts.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiopharmaceutical Chemistry and Applications · Effects of Radiation Exposure · Cancer Immunotherapy and Biomarkers
Introduction
The epidermal growth factor receptor (EGFR) represents a target in cancer therapy, in particular for head and neck squamous cell carcinoma (HNSCC) [1], where EGFR overexpression is associated with poor prognosis [2]. For patients with locally advanced HNSCC, combined irradiation and inhibition of the EGFR with a chimeric monoclonal antibody (cetuximab) is an approved treatment alternative to concurrent, cisplatin-based radiochemotherapy (RCTx) [[3], [4], [5]]. However, no advantage was observed in combination with radiotherapy or RCTx in unselected patients [3,6]. This indicates that predictive biomarkers need to be identified in order to select patients with a potential benefit of combined radiotherapy with EGFR inhibitors.
In a previous study, we analysed the effect of fractionated irradiation and additional inhibition of the EGFR in a preclinical trial using ten different human squamous cell carcinoma xenografts of the head and neck (HNSCC) [7]. Six out of 10 investigated tumour models showed a significant increase in local tumour control for the combined treatment with cetuximab and fractionated radiotherapy compared to irradiation alone. Out of the 6 responders, 3 showed EGFR gene amplification [7]. All of the responders to the combined therapy showed a significant increase in EGFR gene expression compared to non-responders. In addition, 18 genes were identified that showed a significant differential expression between responders and non-responders to the addition of cetuximab. Their expressions were anti-correlated to EGFR, i.e. a high expression was related to low EGFR gene expression and non-response. The genes were related to DNA repair, cell proliferation, epithelial-to-mesenchymal-transition (EMT) and hypoxia. It has been shown that these processes are also involved in response to cisplatin-based RCTx in unselected patient cohorts [[8], [9], [10], [11], [12]].
In this manuscript, we develop a binary classifier from the 18 genes that were differentially expressed between responders and non-responders to the addition of cetuximab to fractionated radiotherapy in the previous preclinical study [7]. Furthermore, we compare its characteristics between xenograft data (treated with radiotherapy and cetuximab) and two cohorts of unselected patients with HNSCC treated by primary or adjuvant cisplatin-based RCTx (without cetuximab). In particular, we study the relation of the classifier to EGFR expression and its prognostic value for loco-regional control (LRC).
Methods
Datasets
In this study, we re-use three previously reported datasets: (a) Sixty untreated xenografts of ten human HNSCC lines (Cal33, FaDu, SAS, SAT, UT-SCC-5, UT-SCC-8, UT-SCC-14, UT-SCC-15, UT-SCC-45, XF354; N = 6 for each) had been fixed in formalin and embedded in paraffine or were cryopreserved within previous local tumour control experiments [7,13]. Two patient datasets are considered, with the diagnosis of locally advanced disease: (b) Patients from the retrospective HNSCC cohort of the German Cancer Consortium − Radiation Oncology Group (DKTK-ROG) treated with primary RCTx (cisplatin- or mitomycin-C-based) between 2005 and 2011 in 6 different institutions (N = 158, median dose 72 Gy). Inclusion criteria, data collection, handling and analyses of biomaterial have been previously described [14]. (c) Patients from the retrospective HNSCC cohort of the DKTK-ROG who were treated with surgery followed by postoperative cisplatin-based RCTx (PORT-C) between 2004 and 2012 in 9 different institutions (N = 221, median dose 64 Gy). Details of this cohort have been described before [15,16].
Gene classifier
For gene expression analysis of xenografts, 10-mm frozen cross-sections of untreated tumours were used. Total RNA was extracted according to the manufacturer‘s instructions (Qiagen, RNeasy Mini Kit), and 80 ng total RNA was used per sample [7]. For both patient cohorts, formalin-fixed, paraffin-embedded (FFPE) tissues were prepared as described previously [14,16]. Gene expression analyses were performed by nCounter technology (nanoString Technologies, Seattle, WA) using an in-house radiobiological gene panel of 209 genes. The panel differed slightly for the adjuvant patient cohort as compared to the xenografts and the primary patient cohort [12]. Details on the procedure and data processing are given in [12]. Gene expressions were available for N = 138 and N = 196 patients in the primary and adjuvant cohort, respectively. The main characteristics of these patients are given in Table 1.Table 1. Patient characteristics of the cohorts treated with primary and adjuvant radiochemotherapy.Primary cohortAdjuvant cohortParameterMedianRangeMedianRangeAge (years)5939–825724–75Dose (Gy)7268.4–746456–68.4Tumour volume (cm^3^)274–176−−NumberPercentNumber****PercentSex (female/male)24/11417/8339/15720/80Localisation (oral cavity/oropharynx/ hypopharynx)23/71/4417/51/3256/113/2728/58/14T stage (1/2/3/4)0/17/38/830/12/28/6033/90/44/2917/46/22/15N stage (0/1/2/3)26/5/99/819/3/72/620/27/124/2510/14/63/13UICC stage (2/3/4)0/12/1260/9/917/31/1584/16/80R status (0/1/missing)−−112/83/157/42/1ECE status (0/1)−−89/10745/55p16 (negative/positive/missing)108/21/978/15/7120/72/461/37/2HPV16DNA (negative/positive/ missing)121/16/188/11/1130/65/166/33/1
Based on the nCounter data of the xenografts, 18 genes have previously been identified that were significantly differentially expressed between xenografts responding to the addition of cetuximab to fractionated radiotherapy and non-responders [7]. The genes were associated with DNA repair (BRCA1, XRCC4, XPC), cell proliferation (RIBC2, STAT5, CENPK, BIRC5, CDKN3), EMT (SNAI1, MME, TIMP, SLC3A2, FOSL1, ITGB1) and hypoxia (ALDOC, ANKRD37, BNIP3, LDHA). Here, we combined these genes to a binary 18-gene classifier, which classifies a xenograft or patient tumour as low or high expressed with respect to these 18 genes. After z-normalising the expressions of each gene, k-means clustering was performed with two cluster centres and Euclidean distance, which stratified the samples into two classes. This procedure was repeated for every cohort due to batch effects between the cohorts. We labelled the class in which the majority of the 18 genes has a higher expression as “high” and the other class as “low”. Cluster centres are reported in Supplementary Table 1. In addition, EGFR gene expression was available from the gene panel.
Statistical analysis
For the xenograft data, the irradiation dose required to control 50 % of the tumours (TCD50) after fractionated radiotherapy and after fractionated radiotherapy with addition of cetuximab was reported previously [7]. For patients, loco-regional control (LRC) was defined as a time-to-event endpoint, calculated from the start of RCTx until the event of a local or regional recurrence or censoring. Spearman correlation was used to analyse the correlation between gene expressions. To compare the expressions and the TCD50 between groups, the Mann-Whitney-U test was employed. For comparing two binary parameters, the chi-squared test was used. The association of continuous expressions on LRC was evaluated by Cox regression. The impact of the 18-gene classifier on LRC was analysed by Gray’s test, considering death as a competing risk [17]. All analyses were performed with SPSS Statistics version 29 (IBM Corporation, Armonk, NY) except for calculating the cumulative incidence function and Gray’s test, which was done in R statistics 4.4.1 (cmprsk package). p-values lower than 0.05 were considered as statistically significant.
Results
For the xenografts receiving combination treatment with cetuximab, most of the 18 genes were expressed in the same direction, i.e. showed a positive Spearman correlation (98.6 % of all significant pairwise correlation coefficients), see Fig. 1A. Fifteen out of 18 genes were negatively correlated with EGFR gene expression (significant correlation for 7 of 18). After k-means clustering, the 18-gene classifier showed a statistical trend for a high expression of the 18 clustered genes to be related to low EGFR (p = 0.081). The models Cal33, SAT, UT-SCC-15 and UT-SCC-8 were mainly classified to the low expression group of the 18-gene classifier and FaDu, SAS, UT-SCCC-14, UT-SCC-45, UT-SCC-5 and XF354 to the high expression group. While EGFR gene expression was positively associated with TCD50 after radiotherapy alone (p = 0.014), the 18-gene classifier was not related to that endpoint (p = 0.68) but to the TCD50 after combined radiotherapy and cetuximab treatment (p = 0.011), Fig. 1D. Here, the high-expression group showed higher TCD50, i.e. was more resistant to the combined irradiation/cetuximab treatment. Also, the low expression group of the classifier showed a larger reduction of the TCD50 after combined radiotherapy and cetuximab compared to the subgroup with high expression of the classifier (p < 0.001). In addition, the 18-gene signature was associated with EGFR gene amplification determined by FISH (p = 0.002) [7].Fig. 1. Spearman correlation between z-normalised gene expressions of genes contained in the developed classifier and EGFR expression for (A) the xenograft cohort, (B) the patient cohort treated with primary radiochemotherapy (RCTx) and (C) the patient cohort treated with adjuvant RCTx. In (D), the TCD50 after fractionated radiotherapy and cetuximab is compared between the xenograft groups defined by the 18-gene classifier. Patient stratification by the classifier for loco-regional control is shown in (E) for the primary patient cohort and (F) for the adjuvant cohort. The cumulative incidence function is shown accounting for the competing risk of death. p-values in (E) and (F) originate from Gray’s test.
For the patient cohort of the DKTK-ROG receiving primary RCTx (but without cetuximab), the expressions of STAT5 and XPC were mostly negatively correlated to the remaining genes (Fig. 1B). Eleven out of 18 individual expressions of the classifier were significantly associated with EGFR gene expression, most of them with a positive correlation (10 of 11). This positive correlation with EGFR translated to the 18-gene classifier (p = 0.005). While EGFR expression was not associated with LRC (p = 0.42), the 18-gene classifier showed a significant association (p = 0.022), with a lower LRC in the high-expression subgroup (Fig. 1E). From the clinical parameters reported in Table 1, a high expression of the classifier was significantly associated with p16 negativity, larger tumour volume, and lower age (Supplementary Table 2).
For the patient cohort of the DKTK-ROG receiving adjuvant RCTx, two genes were missing in the gene panel (ALDOC and BRCA1). The remaining 16 genes were used for determining the classifier. The expressions of CENPK, STAT5, RIBC2 and XPC were mostly negatively correlated to the remaining genes, similar to the primary cohort (Fig. 1C). Twelve out of 16 expressions were significantly associated with EGFR expression, most of them with a positive correlation (10 of 12). This positive correlation with EGFR translated to the gene classifier (p = 0.001). Higher EGFR expression was associated with lower LRC in Cox regression (p = 0.013). The classifier was significantly associated with LRC (p = 0.002), with the high-expression subgroup showing significantly more loco-regional recurrences (Fig. 1F). Several clinical parameters were related to the classifier. A high expression was significantly associated with tumour location in the oral cavity, high T stage, and p16 negativity. After correction for these related parameters in multivariable Cox regression, the classifier lost its significant association with LRC in both patient cohorts (primary RCTx: p = 0.12, adjuvant RCTx: p = 0.14).
Discussion
In this study, we applied an 18-gene classifier, based on genes that were identified as predictive for the addition of cetuximab to fractionated radiotherapy in HNSCC xenografts, for the stratification of two patient cohorts with locally advanced HNSCC treated with RCTx but without cetuximab. In xenografts, the 18 genes were mainly anti-correlated to EGFR gene expression with respect to response to the combined treatment including irradiation plus cetuximab, i.e. higher EGFR gene expression was associated with lower expression of the 18 gene classifier. Furthermore, higher expression of the 18-gene signature was significantly associated with higher TCD50. The prognostic value of the signature for TCD50 after combination of radiotherapy and cetuximab in xenografts translated to the univariable analysis of LRC for both patient cohorts who received combined RCTx. A high expression of the gene signature was related to lower LRC but also to patient and tumour characteristics such as oral cavity carcinoma, high T stage as well as p16 negativity. Due to these associations, the gene classifier lost its significant relation with LRC in multivariable analysis.
In the two patient cohorts, a high expression of the 18-gene classifier was significantly associated with p16-negative tumours and large tumour volumes. The classifier includes genes related to DNA repair (e.g. BRCA [18]), proliferation (e.g. STAT5 [19]), epithelial-mesenchymal-transition (EMT) (e.g. SNAI1 [20]) and hypoxia [9], which are often up-regulated in more aggressive and radioresistant tumours leading to poor prognosis (e.g. large tumours, oral cavity tumours), while HPV-positive HNSCC show higher response rates to radiotherapy due to different underlying biology. However, the genes may not exclusively play a role in radioresistance but could also be involved in resistance to concomitant systemic therapy with cisplatin and/ or EGFR-inhibitors such as cetuximab. For STAT5, it has been shown, that its activation plays a role in EMT, resistance to cisplatin-mediated apoptosis as well as EGFR-inhibition [21]. Also for the EMT-marker SNAI1, its contribution to the induction of an increased cisplatin-resistance [22] as well as to increased resistance to EGFR inhibition with erlotinib [23] has been previously described. Furthermore, a positive correlation between the classifier and EGFR expression was found in the two patient cohorts consisting of both HPV-positive and HPV-negative tumours. In addition, both the classifier and the EGFR gene expression were associated with decreased LRC in patients receiving adjuvant RCTx. After primary RCTx the classifier but not the EGFR gene expression (p = 0.42) was associated with LRC. In the literature, there are conflicting data regarding EGFR expression and radioresistance. In HPV-positive HNSCC models, EGFR-induced impairment of DNA repair and higher response rates to radiotherapy were demonstrated in vitro and in vivo [24]. In contrast, another study showed that inhibition of the EGFR does not lead to increased radiosensitisation on HPV-positive HNSCC [25]. Furthermore, clinical trials with “de-escalation strategies” that replace cisplatin by cetuximab have failed [[26], [27], [28]].
However, there are limitations of our study: The 18-gene classifier originates from a differential gene expression analysis between xenografts responding to additional cetuximab treatment and non-responders. In contrast, the patient cohorts considered in this manuscript did not receive treatment with cetuximab or another EGFR inhibitor, since cetuximab-treated patient cohorts were not available to us. Therefore, the predictive value of the 18 genes could not be assessed. Furthermore, predictive biomarkers for a specific intervention are not necessarily prognostic on unselected patients. Thus, the observed association of the 18-gene classifier with LRC on both patient cohorts may therefore be considered as coincidental or caused by the preselection of genes in the nanoString panel, which was composed to include known factors of radioresistance in HNSCC, such as hypoxia, EMT or DNA repair [[8], [9], [10], [11], [12]]. These factors are important for both, patients treated with adjuvant or primary RCTx. The impact of hypoxia in patients treated with postoperative RCTx may be surprising but suggests its impact also by other radiobiology-related mechanisms, such as increased stemness and tumour invasiveness, as discussed previously [16]. Nevertheless, the prognostic value of the classifier for the combined treatment approaches (combined radiotherapy and cetuximab or combined radiochemotherapy as primary or postoperative therapy) warrants further investigation. Validation in larger datasets is required, also to investigate the relevance of the classifier in combination with other tumour characteristics, such as T stage and p16 status.
Taken together, similarities were observed between xenografts and patient datasets: For both data types, increased EGFR gene expression was associated with radioresistance, i.e. to a higher TCD50 after fractionated radiotherapy in xenografts or to lower LRC for patients. In addition, a high expression of the 18-gene classifier was consistently related to therapy resistance, i.e. a higher TCD50 after combined radiotherapy and cetuximab or lower LRC after RCTx. Importantly, there was no significant association between the 18-gene classifier and the TCD50 after irradiation alone in xenografts. This indicates that the 18 gene classifier might be more suitable to predict the response to the combined treatment (irradiation with cetuximab or combined radiochemotherapy). Further analyses are required to verify this hypothesis.
In summary, this study not only evaluated the prognostic value of biomarker results obtained from HNSCC xenografts to unselected patient data, but also the potential of xenografts as valuable models for the development of biomarkers for individualized radiotherapy [29]. The 18-gene classifier may be further investigated for the stratification of patients with locally advanced HNSCC for combined treatment approaches.
CRediT authorship contribution statement
Steffen Löck: Conceptualization, Data curation, Formal analysis, Methodology, Resources, Writing – original draft, Writing – review & editing. Lydia Koi: Data curation, Methodology, Resources, Writing – review & editing. Kristin Gurtner: Data curation, Methodology, Resources, Writing – review & editing. Fabian Lohaus: Data curation, Methodology, Resources, Writing – review & editing. Max Kemper: Data curation, Methodology, Resources, Writing – review & editing. Dominik Haim: Data curation, Methodology, Resources, Writing – review & editing. Inge Tinhofer: Data curation, Resources, Writing – review & editing. Goda Kalinauskaite: Data curation, Resources, Writing – review & editing. Martin Stuschke: Data curation, Resources, Writing – review & editing. Maximilian Fleischmann: Data curation, Resources, Writing – review & editing. Claus Rödel: Data curation, Resources, Writing – review & editing. Anca-Ligia Grosu: Data curation, Resources, Writing – review & editing. Jürgen Debus: Data curation, Resources, Writing – review & editing. Claus Belka: Data curation, Resources, Writing – review & editing. Stephanie Combs: Data curation, Resources, Writing – review & editing. Simon Boeke: Data curation, Resources, Writing – review & editing. Gustavo Baretton: Data curation, Methodology, Resources, Writing – review & editing. Michael Baumann: Conceptualization, Data curation, Project administration, Resources, Supervision, Writing – review & editing. Mechthild Krause: Data curation, Conceptualization, Project administration, Resources, Supervision, Writing – review & editing. Annett Linge: Conceptualization, Data curation, Methodology, Resources, Supervision, Writing – original draft, Writing – review & editing.
Declaration of competing interest
The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: In the past 5 years, Dr. Mechthild Krause received funding for her research projects by Merck KGaA (2018-2020 for clinical study). Dr. Mechthild Krause and Dr. Annett Linge were involved in a publicly funded (German Federal Ministry of Education and Research) project with the companies Medipan (2019-2022), Attomol GmbH (2019-2022), GA Generic Assays GmbH (2019-2022), Gesellschaft für medizinische und wissenschaftliche genetische Analysen (2019-2022), Lipotype GmbH (2019-2022) and PolyAn GmbH (2019-2022). Dr. Krause and Dr. Linge confirm that, to the best of their knowledge, none of the above-mentioned funding sources were involved in the preparation of this paper.
Michael Baumann, CEO and Scientific Chair of the German Cancer Research Center (DKFZ, Heidelberg) is responsible for collaborations with a large number of companies and institutions worldwide. In this capacity, he has signed contracts for research funding and/or collaborations, including commercial transfers, with industry and academia on behalf of his institute(s) and staff. He is a member of several supervisory boards, advisory boards and boards of trustees. Michael Baumann confirms that he has no conflict of interest with respect to this paper.
Dr. Simon Boeke received speakers’ honoraria and travel expenses by MERCK AG.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Salomon D.S.Brandt R.Ciardiello F.Normanno N.Epidermal growth factor-related peptides and their receptors in human malignancies Crit Rev Oncol Hematol 19199518323210.1016/1040-8428(94)00144-I 7612182 · doi ↗ · pubmed ↗
- 2Ang K.K.Berkey B.A.Tu X.Zhang H.-Z.Katz R.Hammond E.H.Impact of epidermal growth factor receptor expression on survival and pattern of relapse in patients with advanced head and neck carcinoma Cancer Res 6220027350735612499279 · pubmed ↗
- 3Bonner J.A.Harari P.M.Giralt J.Azarnia N.Shin D.M.Cohen R.B.Radiotherapy plus Cetuximab for Squamous-Cell Carcinoma of the Head and Neck N Engl J Med 354200656757810.1056/NEJ Moa 05342216467544 · doi ↗ · pubmed ↗
- 4Bonner J.A.Harari P.M.Giralt J.Cohen R.B.Jones C.U.Sur R.K.Radiotherapy plus cetuximab for locoregionally advanced head and neck cancer: 5-year survival data from a phase 3 randomised trial, and relation between cetuximab-induced rash and survival Lancet Oncol 112010212810.1016/S 1470-2045(09)70311-019897418 · doi ↗ · pubmed ↗
- 5Budach V.Stromberger C.Poettgen C.Baumann M.Budach W.Grabenbauer G.Hyperfractionated Accelerated Radiation Therapy (HART) of 70.6 Gy with concurrent 5-FU/Mitomycin C is Superior to HART of 77.6 Gy Alone in locally Advanced Head and Neck Cancer: long-term results of the ARO 95-06 Randomized phase III Trial. Int J Radiat Oncol 91201591692410.1016/j.ijrobp.2014.12.034 · doi ↗
- 6Ang K.K.Zhang Q.Rosenthal D.I.Nguyen-Tan P.F.Sherman E.J.Weber R.S.Randomized phase III Trial of concurrent Accelerated Radiation Plus Cisplatin with or without Cetuximab for Stage III to IV Head and Neck Carcinoma: RTOG 0522 J Clin Oncol 3220142940295010.1200/JCO.2013.53.563325154822 PMC 4162493 · doi ↗ · pubmed ↗
- 7Koi L.Löck S.Linge A.Thurow C.Hering S.Baumann M.EGFR-amplification plus gene expression profiling predicts response to combined radiotherapy with EGFR-inhibition: a preclinical trial in 10 HNSCC-tumour-xenograft models Radiother Oncol 124201749650310.1016/j.radonc.2017.07.00928807520 · doi ↗ · pubmed ↗
- 8Nordsmark M.Overgaard M.Overgaard J.Pretreatment oxygenation predicts radiation response in advanced squamous cell carcinoma of the head and neck Radiother Oncol 411996313910.1016/S 0167-8140(96)91811-38961365 · doi ↗ · pubmed ↗
