Integrative Predictive Nomograms for Treatment Decision-Making in Resectable Synchronous Colorectal Liver Metastases
Yujuan Jiang, Dedi Jiang, Jinghua Chen, Heting Feng, Zixing Zhu, Jun Jiang, Fan Wu, Jianwei Liang

TL;DR
This study creates a web-based tool to help doctors decide whether to perform surgery or neoadjuvant therapy for colorectal liver metastases patients.
Contribution
The study integrates multiple clinical factors into predictive nomograms and provides a web-based application for practical use.
Findings
The nomograms included nine predictors and showed good consistency in survival predictions.
The C-index and ROC curve results indicated moderate predictive accuracy for overall and disease-free survival.
A web-based application was developed to facilitate the use of these prediction models in clinical practice.
Abstract
Background: Currently, there is no established standard for managing resectable synchronous colorectal liver metastases (CRLM): upfront surgery or neoadjuvant therapy. This study has integrated four available clinical factors - clinicopathological characteristics, gene mutation profiles, imaging findings, and hematological indicators - to create a potentially robust tool aiding clinicians in deciding between upfront surgery and neoadjuvant therapy. Methods: This retrospective cohort study included individuals diagnosed with resectable synchronous CRLM between 2008 and 2018. The development of prediction nomograms entailed identifying independent prognostic indicators through univariate and multivariate Cox analyses. The accuracy of the predictions was evaluated through calibration curves and the C-index. Furthermore, the clinical effectiveness of the nomograms was assessed using DCA…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging
Introduction
Colorectal cancer (CRC) ranks as the third most common cancer globally, with the second-highest mortality rate1. The liver is the most prevalent site for CRC metastases, presenting a challenging prognosis for patients, as evidenced by a 5-year survival rate of less than 10%1. Liver resection is the recommended strategy for patients with colorectal liver metastases (CRLM), endorsed clinically and by the National Comprehensive Cancer Network (NCCN) guidelines as the optimal approach for long-term survival and potential cure2. Moreover, preoperative therapy offers a spectrum of potential advantages, such as timely intervention for micrometastases, assessment of treatment efficacy, and the avoidance of local therapy for patients exhibiting early disease progression indicators2-4. However, the choice between neoadjuvant therapies and upfront surgery remains contentious.
The European Society for Medical Oncology (ESMO)5 recommends that oncological (prognostic) and technical (surgical) criteria be taken into account when evaluating upfront surgery or neoadjuvant treatment. However, neither ESMO nor NCCN provides definitive screening criteria2, 5, which results in a heavy reliance on the doctor's expertise. Consequently, there is a growing trend towards risk stratification and the identification of “high-risk” patients6. However, persistent issues remain in the field of predictive modeling. Some models only present the variables identified in screening without undergoing validation7-9 while others focus exclusively on clinicopathological features, overlooking important predictors such as the status of KRAS and BRAF mutations10.
In addressing the above clinical concerns, this study constructed and validated two evolving preoperative predictive models. It scrutinized nine variables across four domains encompassing clinicopathological features, gene mutation status, imaging evidence, and hematological parameters. By computing patients' survival risk scores, this approach can potentially aid physicians in determining the suitability of upfront surgery for resectable synchronous CRLM.
Materials and Methods
Patient selection
A total of 815 patients diagnosed with CRLM between 2008 and 2018 at the National Cancer Center were included in this retrospective cohort study. Eligibility criteria comprised the following: (1) aged between 20 and 85 years, (2) diagnosis of resectable synchronous CRLM between 2008 and 2018, and (3) undergone radical surgical intervention. Exclusion criteria entailed (1) patients with unknown survival status, (2) individuals who underwent neoadjuvant therapy, (3) patients with extrahepatic metastasis, and (4) patients lacking clear clinicopathological characteristics, imaging examinations, or gene mutation information. A total of 386 patients were recruited for the study. Figure 1 shows the patient screening procedure. The research was retrospectively registered. Patient information was obtained by telephone follow-up (once a year) or review records. The National Cancer Center's Institute Research Medical Ethics Committee approved this study (NCC2021C-125). Following review by the National Cancer Center's Institute Research Medical Ethics Committee, our study was granted an exemption from obtaining signed informed consent.
Prognostic variables
The study gathered data on various parameters including the size of the largest metastatic tumor, number of metastatic tumors, clinical T(cT) and N(cN) stages, KRAS, NRAS, and BRAF statuses, gender, age, body mass index (BMI), American Society of Anesthesiologists (ASA) physical status classification, tumor grade, histological type, primary tumor site, pretreatment carcinoembryonic antigen (CEA) levels, neutrophil, lymphocyte, monocyte, and platelet counts, neutrophil-to-lymphocyte ratio (NLR), platelet-to-lymphocyte ratio (PLR), C-reactive protein (CRP), albumin (Alb), Glasgow Prognostic Score (GPS), red blood cell distribution width (RDW-CV), and D-Dimer levels. The primary outcome of the study was overall survival (OS), defined as the period from the date of diagnosis until death from any cause or the last follow-up. Disease-free survival (DFS) was designated as the secondary endpoint. Patients were divided into two age groups: <70 years old and ≥70 years old. The patients were then categorized based on the following criteria: the largest size of metastatic tumors (<2.25cm and≥2.25cm), the number of metastatic tumors (<3 and≥3), CEA levels (<16.110ng/ml and≥16.110ng/ml), neutrophil count (<3.43510^9^ and≥3.43510^9^), lymphocyte count (<2.79510^9^ and≥2.79510^9^), monocyte count (<0.54510^9^ and ≥0.54510^9^), platelet count (<282.510^9^ and≥282.510^9^), NLR(<2.677 and≥2.677), PLR(<130.481 and≥130.481), CRP levels (<0.085mg/L and ≥0.085mg/L), albumin levels (<44.95g/dl and ≥44.95g/dl), RDW-CV (<15.35 and ≥15.35), and DD levels (<0.545mg/L and ≥0.545mg/L), based on the optimal cut-off value obtained from the receiver operating characteristic (ROC) curve (refer to Supplementary Figure 1).
Statistical analysis
Statistical analysis was conducted utilizing the R program, version 4.3.2, available for download at https://www.r-project.org/. The patients were randomly assigned: 70% were designated to the training cohort, and the remaining 30% were allocated to the validation cohort, utilizing the “caret” package in R. Categorical variables are shown as numbers and percentages. Variables with a significance level of P<0.1 were initially selected using univariate Cox regression analysis, followed by multivariate Cox regression analysis to identify independent factors (P<0.05) influencing OS and DFS. Subsequently, two prognostic nomograms were developed, utilizing distinct predictive variables to predict OS and DFS in patients with resectable synchronous CRLM. Dynamic nomogram models were developed using the “rms” and “Dynnom” packages. We employed the Harrell consistency index (C-index), calibration curve, Receiver operating characteristic (ROC) curves and decision curve analysis (DCA) to assess the model's accuracy and appraise its clinical utility. Based on the prediction model, the risk score for each patient was computed, and the optimal cut-off value from the ROC curve was utilized to stratify patients into high-risk and low-risk groups. Subsequently, Kaplan-Meier survival curves were employed to assess the disparities in OS and DFS between these groups. A significance level of p < 0.05 (two-tailed) was applied to all statistical tests.
Results
Patient characteristics
This study included a total of 386 patients with resectable synchronous CRLM, among whom 270 patients (70%) were randomly allocated to the training group, with the remaining 116 patients (30%) assigned to the test group. The median OS and DFS for the entire patient cohort were 40.50 months (IQR, 23.25-61.00) and 25 months (IQR, 9.25-55), respectively. Table 1 displays the detailed data. The majority of patients (57.3%) were under the age of 70. Adenocarcinoma (98.7%) was the predominant pathological type, mostly highly differentiated. Most patients (74.1%) tested negative for the KRAS gene, with similar findings on the NRAS and BRAF genes. Among the sample, 281 individuals (72.8%) had fewer than three metastases. The most common cT stage observed was T3, accounting for 213 cases (55.2%). The left colon was the primary site for 289 patients (74.9%).
Independent influencing factors of OS and DFS were selected
The univariate Cox regression analysis revealed that the number of metastatic tumors, cN stage, KRAS and BRAF gene mutations, patient age, primary tumor site, CEA levels, neutrophil count, platelet count, and D-Dimer levels exhibited a significance level of P <0.1. Hence, these variables demonstrated a significant correlation with patients' OS. The results of the multivariate analysis indicated that the number of metastatic tumors (HR 2.46, P < 0.001), cN stage (P < 0.001), KRAS mutation (HR1.66, P < 0.05), BRAF mutation (HR 4.58, P < 0.001), age (HR 1.62, P < 0.05), primary tumor site (HR 1.61, P < 0.05), neutrophil count (HR 1.80, P < 0.05), platelet count (HR 2.53, P < 0.05), and D-Dimer levels (HR 1.62, P < 0.05) were the factors significantly associated with OS (Table 2). Supplementary Figure 2 presents a forest plot illustrating the HRs and 95% CIs for OS determined through Cox proportional hazards regression analysis.
In predicting DFS, factors such as the number of metastatic tumors, cN stage, KRAS and BRAF mutations, patient age, primary tumor site, CEA levels, neutrophil count, monocyte count, platelet count, and D-Dimer levels were identified as significant predictors for DFS using univariate analysis. The subsequent multivariate analysis revealed that the number of metastatic tumors (HR 2.48, P < 0.001), cN stage (HR 3.32, P < 0.001), KRAS (HR 1.74, P < 0.05), BRAF (HR 2.81, P < 0.001), age (HR, P < 0.05), original site (HR 1.83, P < 0.05), neutrophil count (HR 1.61, P < 0.05), platelet count (HR 2.58, P < 0.05), D-Dimer (HR 1.62, P < 0.05) were identified as independent predictors. These findings are presented in **Table 3. **Moreover, a forest plot from the multivariate regression analysis for DFS was generated in Supplementary Figure 3.
Development of nomograms for OS and DFS
Key screening indicators, including the number of metastatic tumors, cN stage, mutations in KRAS and BRAF genes, patient age, primary tumor site, neutrophil count, platelet count, and D-Dimer levels, were integrated into the dynamic prediction model to forecast OS and DFS in patients with resectable synchronous CRLM. Figure 2 illustrates the process of determining a patient's score for each variable by intersecting a vertical line with the factor value score line. The cumulative total score corresponds to various survival rates. This visual model enables an intuitive assessment of the likelihood of 1-year, 3-year, and 5-year survival as well as disease-free survival.
Moreover, we have created an interactive online application featuring the generated nomograms (Refer to Figure 3). The hyperlinks (https://alxpcun.shinyapps.io/OS-nomogram/and https://alxpcun.shinyapps.io/DFS-nomogram/)are accessible for navigation.
Validation of the nomograms
The predictive performance of the models in both the training and testing cohorts was evaluated through calibration plots, C-index, AUC, and DCA. Initially, calibration curves were generated to compare the projected 1-, 3-, and 5-year OS and DFS probabilities from the nomogram models with the actual outcomes of resectable synchronous CRLM patients in the training and testing sets. The results demonstrated strong concordance between the predicted probabilities and the observed outcomes, validating a high level of accuracy in the predictions, as depicted in Figure 4. The nomogram exhibited substantial precision in survival prediction, with C-index values of 0.773 ± 0.05 and 0.764 ± 0.053 for OS and DFS, respectively, in the training cohort. In the testing group, the C-index values were 0.747 ± 0.085 and 0.741 ± 0.082 for OS and DFS, respectively. ROC curves were constructed to evaluate the sensitivity and specificity of the nomogram prediction models. The training cohort displayed AUC values of 0.83, 0.83, and 0.77 for OS at 1, 3, and 5 years, respectively, as well as AUC values of 0.81, 0.82, and 0.80 for DFS at the corresponding time points. Remarkably, the testing cohort displayed AUC values of 0.76, 0.71, and 0.78 for OS and 0.78, 0.71, and 0.82 for DFS at 1, 3, and 5 years respectively, as illustrated in*** Figure 5***. In summary, our data validated the high sensitivity and specificity of our nomogram models.
Additionally, DCA serves as a widely adopted method for evaluating the clinical usefulness of nomograms. The nomograms surpass the conventional TNM staging system, demonstrating superior predictive capabilities for mortality risk, as depicted in Figure 6. These results emphasize the considerable practical importance of nomograms in predicting OS and DFS for patients diagnosed with resectable synchronous CRLM.
Risk status of patients stratified by the prediction models
The predictor variable scores were calculated using the nomogram and then totaled to determine the cumulative scores for each patient. Patients with resectable synchronous CRLM were stratified into low- and high-risk categories based on their nomogram scores. The threshold score was set at 295.4 points for DFS and 283.5 points for OS on the nomogram. Patients exceeding the threshold were classified as high risk. Survival analysis revealed a significantly lower probability of DFS and OS in the high-risk group compared to the low-risk group (both P < 0.001). These results indicate that the nomograms utilized in this study can effectively stratify the risk for resectable synchronous CRLM patients (Figure 7).
Discussion
NCCN and ESMO guidelines endorse surgical intervention or perioperative (neoadjuvant plus postoperative) systemic therapy for patients with resectable synchronous colorectal liver metastases (CRLM)2, 5. However, the optimal sequence for administering systemic therapy and resection remains uncertain. The decision between neoadjuvant therapy and upfront surgery for resectable synchronous CRLM varies greatly among medical centers and is largely subjective. Present predictive models for resectable synchronous CRLM predominantly focus on postoperative survival prognostication rather than assisting in the choice between upfront surgery or neoadjuvant therapy. They also disregard vital hematological and genetic factors, such as neutrophil count, platelet count, and KRAS status, which are known to significantly influence patient survival11, 12. Thus, further research is imperative to investigate the factors influencing the prolonged survival of resectable synchronous CRLM patients and to create predictive models for optimizing treatment selection.
In order to enhance the comprehensiveness and accuracy of the model, this study introduced four indices - clinicopathological features, genetic status, imaging, and hematological factors - for the first time. Subsequently, univariate and multivariate Cox regression analyses were conducted using comprehensive clinical data to identify independent risk variables for the prognosis of OS and DFS in patients with resectable synchronous CRLM. Nine prognostic factors were then identified: metastatic tumor count, cN stage, KRAS and BRAF status, patient age, primary tumor location, neutrophil and platelet counts, and D-Dimer level. Furthermore, an interactive online tool was developed to expedite clinical decision-making based on the provided nomograms. The calibration of the nomograms demonstrated strong performance, with the OS and DFS models exhibiting a C-index and AUC greater than 0.7, indicating their robust discriminative ability. Additionally, DCA illustrated that our novel nomogram models offered superior clinical utility compared to the traditional TNM staging system across various threshold probabilities. These results suggest that our nomograms could aid clinicians in deciding between upfront surgery and neoadjuvant therapy.
In recent years, various studies have highlighted the significance of the primary tumor site, lymph node metastasis status, age, and the number of metastatic tumors as key prognostic factors for patients with resectable synchronous CRLM, leading to their incorporation into clinical prediction models13, 14. Our research aligns with these findings. Although there are differences in how these variables are grouped, they consistently emerge as independent risk factors affecting the outcomes of resectable synchronous CRLM patients. Busiman et al.15 integrated several factors including age, gender, location and nodal status of CRC, disease-free interval, number and size of CRLM, preoperative CEA levels, resection margin, presence of extrahepatic disease, KRAS and BRAF mutation status, histopathological growth pattern, perioperative systemic chemotherapy, and perioperative hepatic arterial infusion pump (HAIP) chemotherapy into their clinical prediction model using Cox regression analysis, yielding satisfactory predictive performance. It is noteworthy that their model focused on predicting the 10-year survival post-surgery for patients with CRLM, rather than aiding in preoperative surgical decision-making, and did not consider the predictive value of hematological factors. These limitations are also prevalent in other related studies16-18.
Genetic factors, such as the status of BRAF and KRAS, significantly impact the treatment recommendations for patients with resectable synchronous CRLM11, 19. Previous models were limited by data availability, hindering the incorporation of all relevant variables. Our models emphasize the crucial roles of KRAS and BRAF in predictive modeling. Mutations in KRAS and BRAF promote cancer progression by enhancing angiogenesis, influencing cell motility and adhesion, and triggering aggressive biological traits in cancer cells20, 21. As a result, patients demonstrate significantly decreased OS and early tumor recurrence. Our findings are consistent with the study by Huang et al.22, confirming that KRAS and BRAF mutations act independently as prognostic risk factors in resectable synchronous CRLM patients. While the mutation status significantly influenced the outcome in the prediction model, the mutation rates of KRAS and BRAF were notably low, diminishing their clinical predictive value23. Additionally, increasing research supports the use of hematological markers such as neutrophil count, platelet count, and NLR in predicting the presence of resectable synchronous CRLM24-26, thereby strengthening prediction models. Neutrophils play a significant role in the progression and spread of CRLM, underscoring their clinical relevance. Neutrophils secrete interleukin-1 (IL-1) and tumor necrosis factor, stimulating the production of granulocyte-colony stimulating factors, resulting in elevated levels of vascular endothelial growth factor, antiapoptotic markers, and transcription factors12. Nevertheless, further research is necessary to fully elucidate the underlying mechanism. In a study by Pedrazzan22 et al., it was revealed that patients with a high platelet count experience reduced long-term survival rates. The study further emphasized that a high platelet count independently acts as a prognostic factor for patients who underwent potentially curative resections, supporting our own findings. Despite these similarities, the exact mechanism remains uncertain, with several potential explanations requiring empirical validation27-30. Multiple studies have demonstrated that D-Dimer affects cellular communication systems, promoting increased cell proliferation and initiating angiogenesis31. Additionally, it has been observed to facilitate the progression and spread of malignancies by enhancing tumor cell attachment to endothelial cells32. Chen et al.32 have shown a correlation between elevated D-Dimer levels and decreased OS and DFS, consistent with our own research results. Furthermore, we incorporated D-Dimer levels into the clinical prediction model.
Our clinical predictive models offer several advantages. Firstly, patients with resectable synchronous CRLM were classified into high-risk and low-risk groups based on optimal cutoff values. Substantial differences in OS and DFS were noted across the risk categories. By using a risk score, physicians can tailor treatment strategies for resectable synchronous CRLM patients, aiding in the decision-making process between upfront surgical intervention and neoadjuvant therapy. Secondly, our nomograms incorporate a range of factors, including clinicopathological characteristics, genetic status, radiological findings, and preoperative blood parameters, providing a comprehensive evaluation. Moreover, the prognostic factors considered are readily available through routine preoperative examinations and blood tests, enabling healthcare professionals to personalize treatment plans for each patient.
Nevertheless, our study does have limitations. The retrospective design of the research poses a constraint that could lead to recall bias. Implementing a multicenter prospective cohort study would offer further evidence to substantiate our model. Moreover, to mitigate selection bias, we excluded patients with ambiguous data in their variables. For additional assurance, it may be necessary to validate the nomograms employed in this study through confirmation from cohorts at diverse medical centers.
Conclusion
Our study was groundbreaking as it identified nine preoperatively accessible indicators from four critical domains: clinicopathological features, gene status, imaging, and hematology factors. These indicators were instrumental in developing a predictive model specifically tailored for direct surgery in resectable synchronous CRLM patients. The model has demonstrated exceptional accuracy in guiding treatment selection. In summary, we present a novel predictive model that estimates the survival rates of resectable synchronous CRLM patients who have undergone direct surgery using easily obtainable clinical indicators. This tool is anticipated to significantly aid clinicians in treatment decision-making processes between upfront surgery and neoadjuvant therapy.
Supplementary Material
Supplementary figures and tables.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Bray F Laversanne M Sung H Ferlay J Siegel RL Soerjomataram I Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries CA Cancer J Clin 202474229633857275110.3322/caac.21834 · doi ↗ · pubmed ↗
- 2Benson AB Venook AP Al-Hawary MM Arain MA Chen YJ Ciombor KK Colon Cancer, Version 2.2021, NCCN Clinical Practice Guidelines in Oncology J Natl Compr Canc Netw 202119329593372475410.6004/jnccn.2021.0012 · doi ↗ · pubmed ↗
- 3Araujo RLC Fonseca LG Silva RO Linhares MM Uson Junior PLS Molecular profiling and patient selection for the multimodal approaches for patients with resectable colorectal liver metastases Hepatobiliary Surg Nutr 202413273923861747910.21037/hbsn-22-616PMC 11007353 · doi ↗ · pubmed ↗
- 4Sebagh M Allard MA Cunha AS Ruiz A Araujo R Lemoine AA proposed new method for assessing the pathological response to chemotherapy in resected colorectal liver metastases Br J Cancer 201411147062507230310.1038/bjc.2014.299PMC 4119974 · doi ↗ · pubmed ↗
- 5Van Cutsem E Cervantes A Adam R Sobrero A Van Krieken JH Aderka DESMO consensus guidelines for the management of patients with metastatic colorectal cancer Ann Oncol 20162713864222738095910.1093/annonc/mdw 235 · doi ↗ · pubmed ↗
- 6Kokkinakis S Ziogas IA Llaque Salazar JD Moris DP Tsoulfas G Clinical Prediction Models for Prognosis of Colorectal Liver Metastases: A Comprehensive Review of Regression-Based and Machine Learning Models Cancers (Basel)20241616453873059710.3390/cancers 16091645 PMC 11083016 · doi ↗ · pubmed ↗
- 7Chen F-L Wang Y-Y Liu W Xing B-C Prognostic factors in colorectal liver metastases patients with various tumor numbers treated by liver resection: a single-center, retrospective study World J Surg Oncol 2022202373585436110.1186/s 12957-022-02700-4PMC 9297581 · doi ↗ · pubmed ↗
- 8Fang C Huang Y Chen C Nie D Lin J Xiao Z The Prognostic Value of Serum Apolipoprotein A-I Level and Neutrophil-to-Lymphocyte Ratio in Colorectal Cancer Liver Metastasis J Oncol 2022202291497883620417710.1155/2022/9149788 PMC 9532097 · doi ↗ · pubmed ↗
