PET/CT and MR Improve Interobserver Agreement in Primary Tumor Determination for Radiotherapy in Esophageal Squamous Cell Cancer
Ajra Secerov-Ermenc, Primoz Peterlin, Vaneja Velenik, Ana Jeromen-Peressutti, Jasna But-Hadzic, Franc Anderluh, Barbara Segedin

TL;DR
Using PET/CT and MR imaging improves agreement among doctors when defining the primary tumor for radiotherapy in esophageal cancer.
Contribution
This study shows that PET/CT and fused MR imaging increase interobserver agreement in tumor delineation for ESCC radiotherapy.
Findings
PET/CT with fused MR had the highest conformity index (0.71) for tumor agreement.
PET/CT alone improved agreement compared to CT alone (p = 0.003).
Fused MR with PET/CT outperformed PET/CT alone (p = 0.002).
Abstract
Background/Objectives: The aim of the study was to evaluate interobserver variability in the determination of the primary tumor for radiotherapy treatment planning in esophageal squamous cell carcinoma (ESCC). Methods: Sixteen patients with locally advanced ESCC were included in the analysis. In all patients positron emission tomography with computed tomography (PETC/CT) and magnetic resonance (MR) scans for radiotherapy planning were performed. Five experienced radiation oncologists delineated the primary tumor based on CT alone, MR alone, PET/CT, CT with fused MR and PET/CT with fused MR. Mean tumor volumes were calculated for each patient and imaging modality. The generalized conformity index (CIgen) was calculated to assess agreement in tumor determination. Results: The mean tumor volumes and CIgen for CT alone, MR alone, PET/CT, CT with fused MR and PET/CT with fused MR were 33.1…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEsophageal Cancer Research and Treatment · Lung Cancer Diagnosis and Treatment · Gastric Cancer Management and Outcomes
1. Introduction
Imaging is essential for the diagnosis, staging, treatment planning, and post-treatment follow-up of esophageal squamous cell carcinoma (ESCC). Endoscopic ultrasound (EUS) is generally effective in evaluation of the clinical T stage of ESCC, though caution is needed when differentiating between Tis, T1a, T1b, and in T4 stages [1]. Computed tomography (CT) is commonly utilized for staging of ESCC; however, its accuracy in determining T stage remains limited [2].
Positron emission tomography (PET) combined with computed tomography (CT) is now established as a crucial diagnostic tool for the initial staging of ESCC. This is due to its high accuracy in detecting metastases, including pathological lymph nodes, with a sensitivity of 66% and a specificity of 96% [3]. The tracer most commonly used in ESCC diagnostics is 18-F-fluorodeoxyglucose (FDG), a glucose analog that accumulates in tissues with increased metabolic activity [4,5]. Another important application of FDG-PET/CT in the diagnosis of ESCC is the determination of the extent of the primary tumor. Studies have shown that there is a strong correlation between the tumor length measured on preoperative FDG-PET/CT scans and the actual dimensions in histopathological samples obtained after surgery [6,7]. However, several studies investigating interobserver variability in determining the primary tumor in radiotherapy have shown contradictory results. While some studies suggest potential benefits, others have shown no significant impact of FDG-PET/CT on improving interobserver agreement [8,9].
Magnetic resonance (MR) imaging is promising in the diagnostics of ESCC as it offers better resolution and soft tissue contrast compared to CT imaging. However, MR imaging of areas such as the upper abdomen or mediastinum is challenging due to motion artefacts caused by organ movement and the central location of these regions. In recent years, advances in technology have significantly minimized these imaging artefacts and improved overall image quality [10,11,12]. Hou et al. compared tumor lengths of ESCC determined by various imaging modalities, including CT, T2-weighted MR (T2-MR), and diffusion-weighted imaging (DWI), with pathological lesion length. Their results showed that DWI had a stronger correlation with pathological lesion length than CT or T2 MRI, making it the most accurate imaging method among the methods studied [13]. Moreover, Vollenbrock et al. concluded that agreement in determination of the primary tumor with MR is comparable to that with FDG-PET/CT [14].
Radiotherapy is a cornerstone in the preoperative and definitive treatment of non-metastatic esophageal cancer [15,16,17,18]. However, a recently published landmark study has shown that perioperative chemotherapy improves the survival of patients with resectable esophageal adenocarcinoma (EAC) compared to preoperative chemoradiotherapy [19]. Consequently, radiotherapy remains crucial for the treatment of ESCC only. Accuracy is fundamental in radiotherapy as it may impact local control and the incidence of toxic adverse events [20,21]. Uncertainties in delineation of the target and consequent deviations from the treatment protocol may lead to impaired local control [22]. The aim of our study was to analyze the role of PET/CT and MR on the determination of the primary tumor in radiotherapy for ESCC.
2. Materials and Methods
The study received approval from the National Medical Ethics Committee (Approval No. 0120-620/2019/3) on 21 January 2020, and was conducted in compliance with the Declaration of Helsinki. It was also registered in the ClinicalTrials.gov database under the identifier NCT05611658. Written informed consent was obtained from all participants prior to their inclusion in the study.
2.1. Patients
Previously, we had conducted a prospective analysis of 23 patients with carcinoma of the esophagus (adenocarcinoma and squamous cell carcinoma) [23]. For this analysis, we included data from patients with ESCC only. Patients had to fulfill the following inclusion criteria: locally advanced ESCC, planned preoperative or definitive chemoradiotherapy and no contraindications for MR imaging.
2.2. PET/CT
In all patients, an FDG-PET/CT scan was performed in the treatment position for radiotherapy using a Siemens Biograph™ mCT 40 PET-CT simulator (Siemens Healthineers, Erlangen, Germany) according to a standard preparation protocol. The administered activity of 18F-FDG was 3.7 MBq/kg via intravenous injection. Approximately 60 min later, a CT scan was performed with the following parameters: 120 kV, 200 mAs, a rotation time of 1 s, a pitch of 0.8 and a slice thickness of 3 mm. Before the CT scan, an iodine-based intravenous contrast agent was administered. Subsequently, a PET scan was performed in 3-dimensional mode with a duration of 2 min per bed position.
2.3. MR
MR was performed with a 1.5T Optima™ MR450w GE MR simulator (General Electric, Chicago, IL, USA). Patients were scanned in the treatment position before the start of radiotherapy, without the use of an intravenous contrast agent. T2 images in the transverse plane with a slice thickness of 3 mm and DWI were obtained for each patient.
2.4. Tumor Determination
Five radiation oncologists, each with more than five years of experience in the treatment of esophageal carcinoma, determined the gross tumor volume (GTV) using the Eclipse™ planning system (Palo Alto, CA, USA). To ensure a uniform approach, an initial meeting was held to familiarize the observers with MR images of esophageal carcinoma. Under the guidance of an experienced radiologist, they practiced the delineation of GTV on MR images on two pilot cases. The observers were given relevant information about the location and characteristics of the tumor. The GTV was contoured separately based on several imaging modalities: CT, PET/CT, MR, CT fused with MR, and PET/CT fused with MR. All study images were anonymized to maintain objectivity. GTV was defined as the tumor visible on imaging that encompassed the entire circumference of the esophagus without including regional pathological lymph nodes. To reduce the likelihood of recall bias, contouring was performed with different imaging modalities at least two weeks apart.
During contouring on PET/CT images, observers initially delineated the GTV on CT images and then corrected the volume as needed using PET images. The GTV_PET_ was defined as 20% of the maximum standardized uptake value (SUV). The PET and CT images were fused, and the visible tumor was contoured as GTV, with adjustments made as needed to include GTV_PET_ (Figure 1).
2.5. Data Analysis
GTV volumes were measured for each observer, each patient and all imaging modalities. Average volumes were then calculated for each patient and imaging modality. The generalized conformity index (CIgen) was calculated to assess agreement in tumor determination. This index is independent of the number of volumes analyzed. A CIgen of 1 means perfect agreement between the observers, while CIgen = 0 means that there is no overlap between the delineations [24].
The CIgen was determined for each patient and then averaged across all patients for each imaging modality.
For the original prospective trial, we determined the sample size based on data from existing studies. We calculated that approximately 21 patients would be required, with a significance level of 0.05 and a statistical power of 0.8 [9,14,25]. For the purpose of this analysis, we included only 16 patients with ESCC form the original group. The Wilcoxon signed rank test was used to analyze statistically significant differences. For statistical analysis, we used Statistical Package for the Social Sciences, version 29.0 (SPSS Inc., Chicago, IL, USA).
3. Results
A total of 16 patients diagnosed with ESCC and treated at our institute between April 2020 and May 2021 were included in this additional analysis. Thirteen were men and three were women; the average age was sixty-one years. The characteristics of the patients are presented in Table 1.
Tumor delineation using PET/CT imaging resulted in significantly larger mean volumes (38.10 cm^3^) compared to CT (33.06 cm^3^, p = 0.004). Delineations based on MR were significantly smaller compared to CT (p = 0.007). Volumes based on PET/CT with MR were significantly larger than on CT alone (p = 0.044) (Table 2).
CIgen was significantly higher using PET/CT MR compared to CT (p < 0.001) and PET/CT (p = 0.002). Statistically significant higher agreement was seen for PET/CT compared to planning CT alone (p = 0.003). No statistically significant difference was seen between CT alone and MR (p = 0.796) or CT fused with MR (p = 0.49) (Table 3).
4. Discussion
Our analysis showed that PET/CT significantly improves interobserver agreement in determining the primary tumor of ESCC compared to CT, and the agreement is even higher when MR is added. These results are in contrast to our previous analysis, which also included patients with EAC. The highest interobserver agreement was observed for PET/CT fused with MR, whereas CT alone showed the lowest agreement. However, only the difference in CIgen between MR alone and CT alone was marginally significant (p = 0.048) [23]. Interestingly, studies examining interobserver agreement in assessing the primary tumor, mainly in EAC, also found no significant differences between PET/CT, MRI, and CT. Schreurs et al. analyzed interobserver variation of target volumes between PET/CT and CT in majority of EAC and found no significant difference [26]. Nowee et al. also found no difference in the delineation of GTV between PET/CT and CT in six patients with esophageal cancer, three of whom had EAC and three of whom had ESCC [9]. Another study analyzing the contouring of GTV on MR and PET/CT images in six patients with esophageal cancer (three EAC, three ESCC) showed no significant differences [14]. On the other hand, Toya et al. analyzed 10 patients with only ESCC in their study. They compared the delineation of GTV using PET/CT and CT and found higher agreement with PET/CT (p = 0.005) [8]. We can assume that ESCCs are more avid tumors for FDG than EACs, so tumor assessment on PET/CT is more accurate. In addition, the majority of EACs are tumors of the distal part of the esophagus, which is more prone to movement, and the borders of the tumor are more difficult to assess even with various imaging techniques. Interestingly, MR improves the agreement of tumor assessment in ESCC even more when added to PET/CT. We can explain this by the fact that PET/CT improves the assessment of the location of the tumor, while MR improves the accuracy of tumor margin determination due to its superior soft tissue contrast.
To our knowledge, our study was the first to compare all three imaging modalities to determine interobserver variability. The fusion of all three modalities showed the highest agreement. This could be due to the fact that the observers obtained the most information about the characteristics and extent of the tumor from all three imaging modalities together and therefore determined the tumor with greater agreement. In contrast, in two other similar studies, only one imaging modality was compared with the other, without fusion, and consequently no improvement in agreement was observed [9,14].
Our study has some limitations. Firstly, we did not compare the extent of primary tumors on different imaging techniques with histopathological specimens, making it impossible to determine which imaging method was closest to the “ground truth”. Interestingly, there was only one study that examined primary tumors and regional lymph nodes with PET/MR, PET/CT, MR, and contrast-enhanced CT (CECT), with postoperative pathology serving as the reference standard for assessing diagnostic accuracy. PET/MRI was superior to PET/CT, MRI and CECT [27].
Secondly, our study involved fewer observers compared to two similar studies. The study by Nowee et al. involved 20 physicians from 14 different centers, whereas the study by Vollenbrock et al. involved 10 physicians from two centers [9,14]. As a result, our group was less heterogeneous and there was a possibility that some observers learned from others. This may limit the validity of our results and may not fully reflect the potential for adoption in routine clinical practice.
Thirdly, we analyzed data from only 16 patients. In the original prospective study, the sample was calculated for the analysis of patients with EAC and ESCC. In this “post hoc” analysis, we included data from patients with ESCC only, so the results may not be robust enough. Therefore, to confirm our results, a study with larger number of patients is warranted. However, two studies that used similar methods and imaging modalities both analyzed data from only six patients [9,14]. Neither study found a statistically significant difference in interobserver variability, which could be due to the fact that not enough real-world clinical scenarios were included.
Fourthly, the delineation of primary tumors began with CT alone, followed by MR alone, PET/CT, CT fused with MR, and finally PET/CT fused with MR. The poorest agreement was observed with CT, while the highest agreement was seen with PET/CT fused with MR. This could be partly due to the fact that the observers were able to memorize the anatomical details of the cases during successive contouring sessions, despite a minimum interval of 14 days—often longer—between sessions. Notably, the entire contouring process was completed within a period of almost two years.
Despite these limitations, we can conclude that we found significantly higher agreement in the determination of the primary tumor with PET/CT compared to CT alone, and that the agreement was even higher when we added MR to PET/CT. Interobserver variability could have an important clinical impact as it may influence the occurrence of toxic side effects and local control. Studies investigating the impact of contouring uncertainties on radiation dose distribution help us to identify which differences might be clinically relevant. Several studies on different tumors have shown that interobserver variability has a significant impact on dose coverage to the target or organs at risk [20]. For example, in the treatment of cervical cancer, the effect of interobserver variability on the dose distribution of MR-guided brachytherapy was studied. They found that uncertainty in delineation affected the dose distribution to the target by ±5 Gy and to the organs at risk by ±2–3 Gy [28]. In breast cancer, interobserver variability had no effect on target coverage, while the percentage of lung volume irradiated with 20 Gy (V20) varied between 5% and 25%, heart V20 varied between 0% and 7% and heart V10 varied between 2% and 20% [29]. We found no studies in the literature examining the effects of contouring uncertainties on the dosimetric parameters of radiation treatment plans for esophageal cancer. This could be a starting point for further research, as late toxic effects after radiation treatment for esophageal cancer may lead to poorer survival. The incidence of grade 3 or higher cardiovascular complications following radiotherapy for esophageal cancer is estimated to be between 10.8% and 16.3% and may be related to the dose received by cardiac structures, particularly the coronary arteries [30,31,32]. Pulmonary complications are also frequently observed after radiotherapy of thoracic tumors. In patients treated for esophageal cancer with neoadjuvant chemoradiation therapy and surgery, the mean lung dose was most strongly associated with postoperative pulmonary complications, which occurred in 22.1% of cases [33]. Both cardiovascular and pulmonary complications can have a significant impact on patient survival and quality of life.
Esophageal cancer is not a very common tumor, but it has high mortality and morbidity [34]. Appropriate treatment, including radiotherapy, is important to achieve local control of the disease, which may also have an impact on overall survival. Since higher interobserver agreement could potentially improve local control, it seems reasonable to additionally use MR and PET/CT in routine practice to facilitate the determination of the primary tumor for radiotherapy treatment planning. As esophageal cancer is not a common cancer, this would not present a major financial burden in most centers. However, it should be emphasized that optimization of imaging protocols and further research are needed.
5. Conclusions
In conclusion, our study demonstrated that PET/CT fused with MR achieved the highest agreement in identifying the primary tumor in ESCC. Considering the fact that ESCC is a relatively rare cancer, adopting PET/CT and MR for radiotherapy treatment planning and primary tumor determination seems feasible.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Yang J. Luo G.Y. Liang R.B. Zeng T.S. Long H. Fu J.H. Xu G.L. Yang M.Z. Li S. Zhang L.J. Efficacy of Endoscopic Ultrasonography for Determining Clinical T Category for Esophageal Squamous Cell Carcinoma: Data From 1434 Surgical Cases Ann. Surg. Oncol.2018252075208210.1245/s 10434-018-6406-929667114 · doi ↗ · pubmed ↗
- 2Wang Y. Huang Y. Zhao Q.Y. Li X.Q. Wang L. Wang N.N. Wang J.Z. Wang Q. Esophageal wall thickness on CT scans: Can it predict the T stage of primary thoracic esophageal squamous cell carcinoma?Esophagus 20221926927710.1007/s 10388-021-00886-234642835 · doi ↗ · pubmed ↗
- 3Jiang C. Chen Y. Zhu Y. Xu Y. Systematic review and meta-analysis of the accuracy of 18F-FDG PET/CT for detection of regional lymph node metastasis in esophageal squamous cell carcinoma J. Thorac. Dis.2018106066607610.21037/jtd.2018.10.5730622778 PMC 6297400 · doi ↗ · pubmed ↗
- 4Lu J. Sun X.-D. Yang X. Tang X.-Y. Qin Q. Zhu H.-C. Cheng H.-Y. Sun X.-C. Impact of PET/CT on radiation treatment in patients with esophageal cancer: A systematic review Crit. Rev. Oncol./Hematol.201610712813710.1016/j.critrevonc.2016.08.01527823640 · doi ↗ · pubmed ↗
- 5GauthéM. Richard-Molard M. Cacheux W. Michel P. Jouve J.L. Mitry E. Alberini J.L. Lièvre A. Fédération Francophone de Cancérologie Digestive (FFCD) Role of fluorine 18 fluorodeoxyglucose positron emission tomography/computed tomography in gastrointestinal cancers Dig. Liver Dis.20154744345410.1016/j.dld.2015.02.00525766918 · doi ↗ · pubmed ↗
- 6Han D. Yu J. Yu Y. Zhang G. Zhong X. Lu J. Yin Y. Fu Z. Mu D. Zhang B. Comparison of 18F-Fluorothymidine and 18F-Fluorodeoxyglucose PET/CT in Delineating Gross Tumor Volume by Optimal Threshold in Patients with Squamous Cell Carcinoma of Thoracic Esophagus Int. J. Radiat. Oncol. Biol. Phys.2010761235124110.1016/j.ijrobp.2009.07.168119910143 · doi ↗ · pubmed ↗
- 7Zhong X. Yu J. Zhang B. Mu D. Zhang W. Li D. Han A. Song P. Li H. Yang G. Using 18F-Fluorodeoxyglucose Positron Emission Tomography to Estimate the Length of Gross Tumor in Patients with Squamous Cell Carcinoma of the Esophagus Int. J. Radiat. Oncol. Biol. Phys.20097313614110.1016/j.ijrobp.2008.04.01518538492 · doi ↗ · pubmed ↗
- 8Toya R. Matsuyama T. Saito T. Imuta M. Shiraishi S. Fukugawa Y. Iyama A. Watakabe T. Sakamoto F. Tsuda N. Impact of hybrid FDG-PET/CT on gross tumor volume definition of cervical esophageal cancer: Reducing interobserver variation J. Radiat. Res.20196034835210.1093/jrr/rrz 00430864652 PMC 6530614 · doi ↗ · pubmed ↗
