Genomically determined subtypes and clinicopathological features as predictors of the efficacy of preoperative chemotherapy combined with HER2-targeted therapy for early-stage HER2-positive breast cancer
Sayumi Imamura, Kiyoshi Mori, Hiroyuki Yasojima, Makiko Mizutani, Kumiko Okada, Chie Hayashi, Masayuki Mano, Norikazu Masuda

TL;DR
This study shows that genomic subtypes and HER2 staining intensity can predict the effectiveness of preoperative chemotherapy in early-stage HER2-positive breast cancer.
Contribution
The study identifies HER2 intrinsic subtype and high HER2 staining intensity as independent predictors of pathological complete response in HER2-positive breast cancer patients.
Findings
HER2 intrinsic subtype was associated with significantly higher pathological complete response rates compared to non-HER2 subtypes.
High HER2 staining intensity was an independent predictor of achieving pathological complete response.
MP/BP subtyping showed varied 5-year distant disease-free survival rates across different intrinsic subtypes.
Abstract
Neoadjuvant chemotherapy (NAC) combined with human epidermal growth factor receptor 2 (HER2)-targeted agents is the mainstay of treatment of HER2-positive (HER2 +) breast cancer (BC). We investigated intrinsic subtypes and clinicopathological features as predictors of outcomes of NAC in HER2 + early-stage BC patients. Of 186 consecutive patients with early-stage HER2 + BC, as determined by immunohistochemistry (IHC) and fluorescence in situ hybridization, who received NAC plus HER2-targeted agents, 167 with an available biopsy specimen were eligible for inclusion. Intrinsic subtypes were determined by MammaPrint and Blueprint (MP/BP). Whole-slide images of HER2 IHC staining were used to evaluate intensity of HER2 staining. MP/BP subtype was determined for 124 patients: Luminal-A, 17 patients; Luminal-B, 23 patients; HER2, 76 patients; and Basal, 8 patients. Pathological complete…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBreast Cancer Treatment Studies · HER2/EGFR in Cancer Research · Advanced Breast Cancer Therapies
Introduction
Neoadjuvant chemotherapy (NAC) is widely accepted as one of the cornerstones of treatment for breast cancer [1]. The advantage of NAC is that the response to chemotherapy can be evaluated with surgical material, although studies to date have failed to show a clear difference in long-term outcomes between neoadjuvant and adjuvant settings [2]. Pathological complete response (pCR) has been adopted as a surrogate for long-term outcomes in studies of NAC [3]. A meta-analysis performed by Spring et al. showed a significant correlation between pCR and event-free survival or overall survival in NAC, particularly in human epidermal growth factor receptor 2-positive (HER2 +) and triple negative (TN) breast cancer [4]. Although poor surrogacy of pCR was also reported in another meta-analysis of NAC by Conforti et al., they found a strong correlation between pCR and long-term outcomes in early-stage breast cancer [3]. pCR is considered a reasonable surrogate for long-term outcomes of NAC in early-stage HER2 + or TN breast cancer.
In a meta-analysis by Houssami et al., the pCR rate by NAC was reported to be 8.3%, 18.7%, 38.9%, and 31.1% in estrogen receptor–positive (ER +)/HER2-negative (HER2–), ER +/HER2 +, ER-negative (ER–)/HER2 +, and TN breast cancer, respectively, showing a correlation between pCR rate and breast cancer subtype [5]. In recent decades, the pCR rate of NAC continues to increase due to two main driving forces: improved subtyping and the development of HER2-targeted agents. In particular, the development of trastuzumab and pertuzumab has marked a major turning point in HER2 + breast cancer treatment [6]. HER2 + breast cancer is known to be biologically heterogeneous, exhibits various chemosensitivities, and even de-intensification of treatments is suggested necessary for certain subtypes [7].
The MammaPrint^Ⓡ^ (MP) molecular test measures the expression of 70 genes by means of a tissue microarray, produces an index of genomic risk using a proprietary algorithm, and classifies breast cancer into high- and low-risk groups for distant disease [8]. The Blueprint^Ⓡ^ (BP) molecular subtyping test measures the expression of 80 signature genes to classify breast cancer into three subtypes: luminal, HER2, and basal [9]. Combining MP and BP (MP/BP), breast cancer can be divided into four molecular subgroups (MP/BP subtypes): Luminal-A (BP Luminal A for MP low risk), Luminal-B (BP Luminal B for MP high risk), HER2, and Basal.
The present study retrospectively analyzed the effect of NAC used in combination with HER2-targeted agents on each MP/BP subtype of HER2 + (determined by immunohistochemistry, IHC) early-stage breast cancer.
Methods
Patient selection
Eligible for the present study were patients who had been diagnosed with HER2 + breast cancer and had received neoadjuvant systemic therapy including HER2-targeted agents, followed by definitive surgery, at the National Hospital Organization Osaka National Hospital, Osaka, Japan, between May 2005 and July 2021, and for whom formalin-fixed, paraffin-embedded (FFPE) blocks of pretreatment biopsy or surgical specimens were available from the hospital archives.
Data collection and processing
Information, including age, tumor size, clinical stage, ER status, progesterone receptor (PgR) status, and HER2 staining rate, Ki-67 level, Bloom and Richardson grade, and HER2/chromosome enumeration probe 17 (CEP17) ratio in HER2 IHC 2 + cases, was collected from pathology reports and patient charts. ER and PgR status were considered positive if there was at least 1% positive staining by IHC. ER positivity between > 1% and < 10% was separately defined as ER weakly positive. HER2 status was judged positive if staining intensity was IHC 3 + or IHC 2 + with HER2 gene amplification by fluorescence in situ hybridization (FISH). pCR was defined as ypT0N0 or ypTisN0. Ki-67 level was evaluated, using a 20% cutoff.
Analysis of HER2 staining intensity and heterogeneity
This analysis was carried out using biopsy specimens obtained from 167 patients and stored at our hospital. After being subjected to HER2 immunohistochemical staining, slides prepared using these specimens were first examined under a light microscope before being digitized for image analysis. HER2 staining patterns were evaluated according to the intensity and heterogeneity of staining (Fig. 1). Staining intensity was quantified using ImageJ software (0 = white, 225 = black) and defined as High-intensity or Low-intensity based on its being, respectively, higher or lower than the cutoff value determined using the Youden index, which maximized the area under the receiver operating characteristic (ROC) curve. HER2 staining was classified as “heterogeneous” when areas of intense staining were interspersed with faint staining, or “homogeneous” when membrane staining was uniform.Fig. 1. Typical examples of immunohistochemistry (IHC) human epidermal growth factor receptor 2 (HER2) staining patterns. FISH, fluorescence in situ hybridization
Analysis of genomically determined subtypes
For MP/BP analysis (performed by Agendia Inc., Amsterdam, the Netherlands), 10 consecutive slides (5 μm each) were prepared from FFPE tissue blocks of biopsy specimens and surgical specimens. Biopsy specimens were submitted to MP/BP for all cases in which FFPE blocks were stored at our hospital. Surgical specimens were submitted if more than one-third of the tumor volume remained after NAC. RNA was extracted from thinly sliced samples, and the extracts were subjected to quality control prior to sequencing analysis. Samples deemed by quality control to be of insufficient RNA quality were excluded from the analysis.
The primary endpoints of this study are differences in pCR rates based on staining intensity and heterogeneity, as well as pCR rates by each intrinsic subtype. Secondary endpoints were defined as postoperative distant disease-free survival (D-DFS) rates for each intrinsic subtype. D-DFS was defined in accordance with the guidelines regarding clinical events to be included, namely death from any cause and distant recurrence [10].
Statistical analysis
All statistical analyses were performed using R version 4.2.0 (R Project for Statistical Computing, Vienna, Austria). Univariate analyses were performed using the chi-squared test or Fisher’s exact test for 2 × 2 tables, and the chi-squared test for n × 2 tables (n ≥ 3). Variables with p < 0.05 were entered into a multivariate logistic regression model for pCR, or a Cox proportional hazards model for D-DFS. Because of the small number of patients with each subtype, certain analyses, such as the comparison of survival outcomes, were performed for HER2 vs. non-HER2. The reported statistical significance levels were all two-sided, and p-values of less than 0.05 were considered statistically significant.
Results
Clinical features
MP/BP was successfully performed on biopsy specimens from 124 of the 160 patients who met the inclusion criteria for the present study (Fig. 2). Table 1 summarizes the characteristics of these 124 patients and the clinicopathological features of their tumors. The median age of the patients was 56.0 years, and 65% were aged ≥ 50 years. All had received taxane-containing NAC with trastuzumab, and additional anthracycline-containing chemotherapy had been administered in 67.0% of cases. The combination of trastuzumab and pertuzumab was used for only 24 patients (19.4%), since pertuzumab in neoadjuvant settings had become insurable in Japan only in October 2018.Fig. 2. Disposition of the patients included in the present study. Histopathological results for 23 patients were available for comparison of biopsy specimens with matched surgical samples, after 40 patients had been excluded for the following reasons: MammaPrint and Blueprint (MP/BP) molecular type tests had not been performed on the biopsy specimens (n = 9), results for biopsy specimens were not detected (n = 10), and results for surgical specimens were not detected (n = 21). pCR, pathological complete responseTable 1Characteristics of the patients for whom MammaPrint and Blueprint subtyping was successfully performed in the present study, and the clinicopathological features of their tumorsCharacteristicpCR(n = 51)non-pCR(n = 73)Univariate analysisMultivariate analysisOdds ratio95% CIp-valueOdds ratio95% CIp-valueAge, years ≤ 5016250.880.41–1.880.847^a^ > 503548cT T110190.690.29–1.650.518^a^ T2, T3, T44154cN N036530.910.41–2.000.814^a^ N1, N2, N31520ER Negative22124.031.87–8.68< 0.001^a^3.231.31–7.990.011 Weakly positive76 Positive2255HER2 IHC 3 + 49643.450.71–16.670.193^b^ IHC 2 + and FISH + 29Histological grading HG 340412.841.26–6.390.012^b^3.121.18–8.220.021 HG 1,21132Ki-67^c^ < 20%3120.360.10–0.380.109^b^ ≥ 20%3754 Unknown117Neoadjuvant chemotherapy Taxane + anthracycline32510.730.00–1.370.442^a^ Taxane only1922Anti-HER2 therapy Trastuzumab alone40600.790.32–1.930.648^a^ Trastuzumab + pertuzumab1113HER2 staining intensity High intensity40256.983.06–15.92< 0.001^a^4.121.55–10.900.004 Low intensity1148Intrinsic subtype^d^ Luminal-A01710.163.83–26.81< 0.001^a^4.471.46–13.700.009 Luminal-B221 HER24531 Basal44ER estrogen receptor, FISH fluorescence in situ hybridization*, HER2* human epidermal growth factor receptor 2, *HG *histological grade, IHC immunohistochemistry^a^Fisher’s exact test; p-values for each patient subgroup have been corrected using the Hochberg’s method^b^Chi-square test; p-values for each patient subgroup have been corrected using the Hochberg’s method^c^The chi-square test for Ki-67 was performed for patients excluding those with unknown Ki-67 status^d^Because of the small number of patients in each group, the intrinsic subtypes were divided into two groups (HER2 and others) and statistically examined
Reasons for the failure of MP/BP subtyping in the remaining 36 of the 160 biopsy specimens submitted were insufficient RNA (n = 23), insufficient tissue (n = 11), ductal carcinoma in situ only (n = 1), or severe inflammation (n = 1). Among the 124 cases for which MP/BP subtype was determined, the distribution of subtypes was as follows: Luminal-A, 17; Luminal-B, 23; HER2, 76; and Basal, 8. Of the 92 cases identified by IHC as ER +, more than half were non-Luminal (Luminal-A, 17 cases [18.5%]; Luminal-B, 23 cases [25.0%]; HER2, 50 cases [54.3%]; Basal, 2 cases [2.2%]). All 32 ER– cases were non-Luminal (HER2, 26 cases [81.2%]; Basal, 6 cases [18.8%]). All Basal cases, and all except 2 HER2 cases, were classified as MP high risk.
Treatment response according to the MP/BP system
Of the 124 patients whose biopsy specimens were successfully subtyped by MP/BP, 51 (41.1%) achieved pCR. pCR rate was low in those with the Luminal-A (0%) and Luminal-B (8.6%) subtypes, but high in those with the HER2 (59.2%) and Basal (50.0%) subtypes. Thus, pCR rate was higher in HER2 cases than in non-HER2 cases (59.2% vs. 12.5%; odds ratio, OR, 10.16 [95% confidence interval, CI, 3.58–26.81], p < 0.001), and higher in IHC ER– cases than in IHC ER + cases within the intrinsic HER2 subtype (73.0% vs. 50.0%; OR, 2.92 [95% CI, 1.06–8.09], p = 0.03). In ER-positive cases, BP classification correlated well with pCR rate; even in the IHC ER + cases, 50% of patients with BP HER2 achieved pCR, which was a significantly higher proportion than for patients with IHC ER + BP non-HER2 tumors (5.13%) (OR, 19.50 [95% CI, 4.11–92.50], p < 0.01).
HER2 staining pattern and treatment response
HER2 staining intensity was evaluated for 167 biopsy specimens with available whole-slide images. ROC analysis identified 149.50 as the optimal cutoff value. Mean intensity score was 172.6 in the High-intensity group (n = 87) and 111.9 in the Low-intensity group (n = 80). For the biopsy specimens from 124 patients for whom MP/BP results were available, 65 were of High-intensity and 59 were of Low-intensity. The pCR rates for the High-intensity and Low-intensity groups were 61.5% and 18.6%, respectively. High-intensity was strongly associated with the intrinsic HER2 subtype. When combining data for HER2 IHC intensity and intrinsic subtype, four groups were identified: High-intensity HER2 (pCR rate, 69.1%), High-intensity non-HER2 (pCR rate, 20.0%), Low-intensity HER2 (pCR rate, 33.3%), and Low-intensity non-HER2 (pCR rate, 10.5%) (Table 2).Table 2. Comparison of pCR rates by HER2 staining intensity and intrinsic subtypes in biopsy specimens from 124 patients with available MammaPrint and Blueprint resultsHER2 intensityIntrinsic subtype based on HER2 statusnpCR, n (%)npCR, n (%)High-intensity6540 (61.5)HER25538 (69.1)Others102 (20.0)Low-intensity5911 (18.6)HER2217 (33.3)Others384 (10.5)Total12451 (41.1)HER27645 (59.2)Others486 (12.5)HER2 human epidermal growth factor receptor 2, pCR pathological complete response
Heterogeneous HER2 staining was observed in 4 cases, all of which were classified as High-intensity. About half of the tumor remained after NAC in 2 cases, and about one-third of infiltrating cancer tissue remained after NAC in the other 2 cases.
Factors predicting pCR achievement
The results of univariate analyses showed negative or weakly positive ER status, high Bloom and Richardson histological grade, HER2 intrinsic subtype, and High-intensity of HER2 staining to be significantly correlated with achievement of pCR. Multivariate logistic regression analysis of data for factors identified as significant in univariate analysis (i.e. ER status, histological grade, HER2 staining intensity, and intrinsic subtype) showed all four to be independent predictors of pCR achievement (Table 1).
Prognosis
The median follow-up period was 73.0 months, ranging from 5.7 to 154.0 months. Overall, 13 (10.5%) of the 124 patients for whom MP/BP results were available experienced either local recurrence (2 patients, 1.6%) and/or distant disease (11 patients, 8.9%). In the group with distant recurrence, deaths due to breast cancer were recorded during the observation period, but no deaths due to causes other than breast cancer were confirmed. Kaplan–Meier estimates of 5-year D-DFS stratified by intrinsic subtype were 92.3% (95% CI, 0.789–1.000) for Luminal-A, 81.6% (0.647–1.000) for Luminal-B, 84.0% (95% CI, 0.747–0.944) for HER2, and 80.0% (95% CI, 0.516–1.000) for Basal (Fig. 3a). Cox proportional hazards analysis showed no significant differences between MP/BP subtypes. Kaplan–Meier curves for the pCR group and the non-pCR group according to intrinsic subtype are shown in Fig. 3b. No patients with the Luminal-A subtype achieved pCR. Regarding the other subtypes (Luminal-B, HER2, and Basal), there were no differences in prognosis between the pCR and non-pCR groups.Fig. 3. Kaplan–Meier estimates of 5-year distant disease-free survival (D-DFS) for the 124 patients included in the present study: **a **comparison of results for patients with the Luminal-A, Luminal-B, HER2, and basal subtypes, and of results for those in the non-HER2 vs. HER2 groups; **b **comparison of results for patients with each subtype (i.e. Luminal-A, Luminal-B, HER2, and Basal), based on whether they did or did not achieve pathological complete response (pCR) (i.e. pCR vs. non-pCR). CI, confidence interval; HER2, human epidermal growth factor receptor 2
Surgical samples with approximately one-third of all cancer cells remaining after NAC were eligible for MP/BP testing. However, for 24 of these, testing was not possible, due to insufficient RNA quantity; they were obtained mainly from tumors that were small originally (cT1–T2). Regarding the biopsy specimens, the intrinsic subtype was either unexamined or not determined in 19 cases. Having excluded these surgical samples and biopsy specimens, it was possible to compare in each of 23 cases the intrinsic subtype determined using a biopsy specimen with that determined using a surgical sample. The latter differed from the former in 10 of these 23 cases (Fig. 4).Fig. 4. Intrinsic subtype changes attributable to neoadjuvant chemotherapy: comparison of results for the biopsy specimens and matched surgical samples of 23 patients. The subtype changed for 10 patients. HER2, human epidermal growth factor receptor 2
Recurrence occurred in one of the 23 patients whose cancer had been subtyped using both a biopsy specimen and a surgical sample. For that patient, in both cases the subtype was Luminal-B.
Discussion
The present study showed that negative or weakly positive ER status, histological grade 3, MP/BP HER2 subtype, and High-intensity of HER2 IHC staining were independent factors significantly predicting achievement of pCR with NAC. It is noteworthy that the pCR rates were considerably different among MP/BP subtypes, even in IHC/FISH HER2 + breast cancer. These results are consistent with previous reports. Choi et al. showed that the pCR rate by NAC combined with trastuzumab was significantly higher in the cases with high HER2 gene amplification in FISH (> 10.0 HER2 gene copies per cell) than those of low amplification (6–10 HER2 gene copies per cell) (56% vs. 22%; p < 0.005) [11]. Another study also demonstrated a positive correlation between FISH amplification of HER2 and the pCR rate by NAC combined with trastuzumab [12]. Other previous studies revealed that high amplification or high mRNA expression of HER2 showed a positive correlation with response to trastuzumab in a subset of patients with IHC/FISH HER2 + breast cancer [13, 14]. However, in these studies, HER2 was semi-quantified by a single genomic test. In the present study, a multigene assay using MP/BP was performed, and hence subtyping was more thorough and comprehensive.
In the present study, we also evaluated whether there was heterogeneity in IHC staining in biopsy specimens, but only 4 out of 167 cases showed obvious heterogeneity. Of these 4 cases, 2 were pCR and 2 were non-pCR, but due to the small number of cases, we were unable to sufficiently evaluate the impact of IHC heterogeneity on treatment efficacy. Differences in treatment efficacy based on FISH copy number have been reported, and it has been suggested that cases with strong heterogeneity may not respond adequately to HER2-targeted agents [15]. However, there are few studies that evaluate HER2 heterogeneity using IHC alone and examine the efficacy of drug therapy, so it may be that IHC alone is insufficient for evaluating HER2 heterogeneity.
The pCR rates ranged from 0 in Luminal-A patients to 59.2% in HER2 patients, depending on intrinsic subtype. HER2 patients with IHC/FISH HER2 + breast cancer had significantly better results than non-HER2 patients, especially those with the BP luminal subtype, in response to NAC combined with HER2-targeted agents. These results were consistent with the findings from the PAMELA study, in which PAM50 was used as a multigene assay to define genomic subtypes so as to identify patients who are likely to benefit from dual HER2 blockade with lapatinib and trastuzumab as NAC without cytotoxic agents [16]. About half of IHC/FISH ER + HER2 + breast cancer patients were in the Luminal subgroup and had a significantly lower pCR rate but a better prognosis, even without having achieved pCR in response to NAC. Whereas NAC combined with HER2-targeted agents is considered suitable treatment for cases of HER2 breast cancer, it appears that NAC regimens should be reconsidered for the treatment of non-HER2 cases of IHC/FISH HER2 + breast cancer. NAC combined with HER2-targeted agents might be an overtreatment for Luminal cases. Indeed, based on the results of the KATHERINE study [17], it is common practice for trastuzumab emtansine (T-DM1) to be administered to prevent distant recurrence in IHC HER2-positive patients who have failed to achieve pCR with NAC. However, our study suggests that patients with intrinsic Luminal tumors are less likely to relapse even if pCR was not achieved, so administration of additional potent anti-cancer might be excessive for them. In the present study, the pCR rate was slightly high in patients who received NAC in combination with trastuzumab and pertuzumab, compared with those who received NAC with trastuzumab alone. Although this finding is not based on a strict comparison, similar results have been reported in previous global clinical trials such as the NeoSphere trial [18], and the TRYPHAENA trial [19].
In the HERA study, FISH amplification and HER2 staining intensity were positively correlated [20]. We also found a good correlation between High-intensity and HER2 status, together with a highly significant positive correlation between HER2 staining intensity and pCR. These results suggest that a high intensity of HER2 IHC staining can be an alternative to HER2 in predicting achievement of pCR when genomic analysis is unavailable. In the absence of homogeneous membranous staining, attention should be paid, as the therapeutic effect might be limited even with high staining intensity.
Given that in the present study the observation period was relatively short (73.0 months), the number of patients who experienced postoperative recurrence was low (local recurrence, 1.6%; distant recurrence, 8.9%). Among patients who experienced distant recurrence, there were cases of death due to breast cancer, but no cases of death due to causes other than breast cancer were observed. The 5-year D-DFS rate was 92.3% for patients with the Luminal-A subtype, 81.6% for those with the Luminal-B subtype, 84.0% for those with the HER2 subtype, and 80.0% for those with the Basal subtype. No significant differences in prognosis were observed among the intrinsic subtypes. Similarly, no differences in prognosis were observed when tumors were classified as HER2 or non-HER2. Although differences in pCR rates were observed for each intrinsic subtype, the lack of differences in distant recurrence was considered due to the relatively short observation period and the small number of recurrence cases. In addition, in Japan, T-DM1 was approved only from August 2020, for the purposes of health insurance coverage, as adjuvant therapy for early-stage HER2-positive breast cancer patients who have not achieved pCR. In the present study, only 6 of 73 patients in the non-pCR group received T-DM1, and we consider it important to note this difference from current clinical practice.
In the present study, the results of MP/BP could not be obtained for 36 (22.5%) of all 160 biopsy specimens, mainly because of insufficient RNA quality or quantity. The quality of RNA was insufficient in 5.8% of 86 samples collected between 2012 and 2021, compared with 24.3% of the 74 samples collected between 2005 and 2011. The difference was statistically significant (p < 0.001). Tumor tissue older than 10-years might be unsuitable for analysis using MP/BP. The key limitations of this study are that it is retrospective study, with a limited number of patients, and the chemotherapy regimens and HER2-targeted therapy varied with patient background and the era. Moreover, it is well known that tumors of the Luminal-A type tend to recur late, often more than 5 years after surgery. Considering the relatively short observation period in the present study, the prognosis for Luminal-A type tumors has not been sufficiently evaluated.
Conclusions
Our study highlights highly significant correlations between HER2 staining intensity and intrinsic subtypes, and both were significant predictors of therapeutic effects of NAC combined with HER-targeted agents. The findings indicate the essential role of MP/BP subtyping in the treatment of HER2 + breast cancer. Although intrinsic HER2 showed a higher pCR rate, about half of IHC ER + HER2 + breast cancers were of the Luminal subtype, with a better prognosis. Further analysis is needed on the treatment strategies for intrinsic Luminal-subtype breast cancers with positive HER2 IHC staining.
