Predictive power of combined inflammatory markers and magnetic resonance imaging features for glioma grading using machine learning: a retrospective study
Yaohua Liu, Yuting Wang, Junle Zhu, Shuang Qin, Yi Hong, Saifei Sun, Qin Zhang, Qi Lv

TL;DR
This study shows that combining MRI scans and blood markers with machine learning can accurately predict the severity of brain tumors called gliomas.
Contribution
The novel contribution is integrating inflammatory markers with MRI features using machine learning to improve glioma grading accuracy.
Findings
The support vector machine model achieved an AUC of 0.92 and F1 score of 0.91 in distinguishing glioma grades.
Combining MRI features with inflammatory markers improved predictive performance for glioma grading.
Five-fold cross-validation confirmed the reliability of the model's performance.
Abstract
This study aimed to explore the potential of integrating magnetic resonance imaging (MRI) features with inflammatory markers to predict glioma grading using machine learning models. A total of 179 glioma patients were analyzed. The dataset was randomly split into a training set (75%) and an independent test set (25%) to test hyperparameter. To enhance reliability, five-fold cross-validation was also applied during the training phase. Key MRI features and inflammatory markers, including relative apparent diffusion coefficient (rADC), monocyte count, lymphocyte-to-CRP ratio (LCR) and hemoglobin-albumin-lymphocyte-platelet index (HALP), were extracted and used as inputs for multiple machine learning classifiers. Model performance was assessed using metrics such as area under the curve (AUC), accuracy, and F1 score. The support vector machine model exhibited superior predictive…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Glioma Diagnosis and Treatment · Brain Tumor Detection and Classification
Background
Gliomas, the most common primary tumors of the central nervous system, present a significant clinical challenge due to their high recurrence rates and poor overall survival. This is further compounded by the limited availability and often suboptimal effectiveness of current treatment options [1]. Traditionally, glioma prognosis has relied on invasive procedures such as surgical biopsy and histopathological evaluation. However, the inherent heterogeneity of gliomas can hinder the accuracy of pathological assessments. Recent advances in medical imaging and tumor biology have facilitated the development of non-invasive methods for prognostic evaluation [2].
Magnetic resonance imaging (MRI) has become a cornerstone in the diagnosis of brain tumors, offering critical insights into tumor morphology and, to some extent, metabolic activity. Among the key MRI-derived metrics, the contrast enhancement ratio (ER) and the apparent diffusion coefficient (ADC) from diffusion-weighted imaging (DWI) are particularly significant. ER reflects tumor vascularity and aggressiveness, whereas ADC indicates tumor cellularity and malignancy potential. The relative ADC (rADC), defined as the ratio of the lesion-focused ADC to that of the surrounding tissue, serves as a normalized marker of water molecule diffusion, reducing the influence of inter-individual variability. Both ER and rADC are integral to glioma grading, prognosis, and treatment monitoring, offering valuable insights into tumor biology and behavior [3, 4].
Moreover, emerging evidence indicates that hematological markers of systemic inflammation can serve as reliable predictors in the preoperative assessment of cancer. Elevated markers such as the neutrophil-to-lymphocyte ratio (NLR) have been associated with poor outcomes in patients with neurogliomas [5, 6]. The rationale for combining systemic inflammatory markers with radiologic features in glioma stems from the understanding that gliomagenesis and progression are influenced by both local tumor characteristics and the systemic inflammatory response [7]. Prior studies have demonstrated that the interplay between tumor-induced inflammation and angiogenesis, as reflected by MRI parameters like ER, can synergistically predict glioma grade and prognosis [8]. For instance, a study showed that combining NLR with MRI-derived texture features improved the accuracy of glioma grading compared to using either modality alone [9]. These findings suggest that integrating inflammatory markers with imaging data can provide a more comprehensive assessment of the tumor microenvironment and its impact on disease progression.
In this context, a feasibility study has highlighted the potential of using advanced imaging techniques and machine learning algorithms to enhance the prognostic evaluation of glioblastoma, demonstrating that such approaches can significantly improve predictive accuracy and clinical decision-making [10]. However, limited research has investigated the combined predictive value of inflammatory and nutritional markers in the pathological grading of gliomas, particularly in the context of advanced machine learning techniques. Furthermore, the feasibility of using such a combined approach to predict outcomes in specific glioma subtypes, such as glioblastoma, warrants further investigation. A recent study explored the use of radiomic features derived from MRI in conjunction with clinical data to predict survival in glioblastoma patients, demonstrating the potential of integrating multi-modal data for improved prognostication [11]. This highlights the need for further research to refine and validate these approaches in larger, more diverse patient cohorts.
The selection of non-invasive biomarkers is critical for accurately predicting the pathological grade of gliomas. Machine learning models are particularly well-suited for this task, as they can effectively analyze complex, multidimensional datasets and reveal subtle relationships among imaging parameters, inflammatory markers, and tumor prognosis. Therefore, this study aims to use artificial intelligence technology to seek its sensitive indicators and evaluate predictive ability in glioma grading by comprehensive analysis of clinical data, imaging characteristics and inflammatory markers.
Methods
Study participants
This study was a retrospective single-center study with continuous enrollment of glioma patients treated in the neurosurgery department of Shanghai Tongji hospital between January 2022 and December 2024. All pathological and clinical data were collected and verified by experienced physicians. Glioma grading was conducted according to the 2016 World Health Organization (WHO) Classification of Tumors of the Central Nervous System. The study was approved by the hospital’s ethics review board, and written informed consent was obtained from all patients before participation.
Inclusion criteria
The patient must be pathologically confirmed as having glioma (WHO grades I-IV); MRI imaging must be completed within three days prior to surgery with complete visual characteristics and data suitable for subsequent quantitative analysis; Comprehensive records of inflammatory markers and clinical features are required, with blood test results for inflammatory markers obtained within 48 hours before surgery; Complete follow-up data and survival outcome information must be provided.
Exclusion criteria
Image quality does not meet the analysis requirements (such as strong motion artifacts, missing sequences, severe artifacts, etc.);
History of radiotherapy, chemotherapy or other interventions known to alter the tumor microenvironment;
Having any of the following conditions that may significantly affect levels of inflammatory markers and not excluded or stabilized prior to blood sample collection: a) Acute or persistent infections (e.g., active meningitis, cerebrospinal fluid infections, systemic bacterial/viral infections, etc., uncontrolled by non-therapeutic interventions during blood testing), b) Immune/inflammatory diseases (e.g., rheumatoid arthritis, systemic lupus erythematosus, inflammatory bowel disease, ankylosing spondylitis, etc.), c) Major trauma or surgical history within two weeks prior to blood sample collection, d) Recent use of systemic glucocorticoids, immunosuppressants, or long-term anti-inflammatory drugs (diagnosed within two weeks prior to blood testing), e) Pregnancy or delivery period, f) Coexisting malignancies or chronic liver/kidney dysfunction that may interfere with inflammatory markers (diagnosed within four weeks prior to blood testing), g) Other conditions that may significantly affect inflammatory marker levels or, as determined by investigators, potentially compromise study outcomes.
Blood test
Peripheral blood samples were collected from all patients, and routine hematological analyses were performed. The measured or calculated parameters included white blood cell count, neutrophil count, lymphocyte count, platelet count, serum albumin concentration, and C-reactive protein (CRP) levels. Based on these values, several composite inflammatory and nutritional indices were derived, including NLR, derived NLR (dNLR), platelet-to-lymphocyte ratio (PLR), platelet-to-monocyte ratio (PMR), lymphocyte-to-monocyte ratio (LMR), lymphocyte-to-CRP ratio (LCR), neutrophil-to-CRP ratio (NCR), prognostic nutritional index (PNI), advanced lung cancer inflammation index (ALI), albumin-lymphocyte-platelet-CRP index (ALPC), systemic immune-inflammation index (SII), CRP-albumin-lymphocyte index (CAL), CRP/albumin ratio (CAR), and hemoglobin-albumin-lymphocyte-platelet index (HALP). The detailed formulas for calculating these indices are provided in the Appendix 1.
MRI
MRI data were acquired using a 3.0T SIEMENS Verio superconducting magnetic resonance scanner. The imaging protocol included T2-weighted imaging (T2WI), unenhanced and contrast-enhanced T1-weighted imaging (T1WI), and DWI. The acquisition parameters were as follows: for T2WI, repetition time (TR)/echo time (TE) = 4210/96 ms; for T1WI, TR/TE = 1530/9 ms, with a pixel matrix of 256 × 256, field of view (FOV) of 23 × 23 cm^2^, slice thickness of 5 mm, and an intersection gap of 1 mm. For contrast-enhanced imaging, 3 mL/s of gadopentetate dimeglumine was administered intravenously, followed by contrast-enhanced T1-weighted image acquisition. DWI was performed with TR of 5400 ms, TE of 94 ms, and b-values of 0 and 1000 s/mm^2^. Axial, coronal, and sagittal planes were scanned for all sequences.
Segmentation
This study primarily focuses on preoperative ET (Enhancing Tumor) as the main segmentation target. Edematous regions are identified using T2/FLAIR sequences as supplementary markers, with the entire tumor-edema boundary defined through ET delineation. The segmentation work in this study was performed using the hospital radiology PACS system. Two radiologists with experience in neurology radiology were compared after independent evaluation. If there was disagreement, the final consensus ROI was formed through consensus consultation. The segmentation tool utilized was the built-in ROI drawing/annotation function of the hospital PACS system, with ROI areas ranging from 15 to 50 mm^2^.
ER, rADC, and EI based on consensus ROI area were calculated by two physicians independently. The calculation results of the two radiologists were evaluated for intergroup consistency using the interclass correlation coefficient. When the ICC value is below 0.5, it indicates poor consistency strength in measurement results. An ICC value between 0.5 and 0.75 reflects moderate consistency strength. Values ranging from 0.75 to 0.9 demonstrate good consistency in measurement results. When the ICC value exceeds 0.9, it signifies excellent consistency strength.
Feature selection and preprocessing
To minimize heterogeneity bias among patients, all data were standardized using the Min-Max normalization method. Feature selection was performed using the analysis of variance (ANOVA) F-test to identify features with significant heterogeneity. Additionally, the Pearson correlation coefficient was applied to evaluate the relevance between each feature and the target variable.
Group classification
The selected features were used to predict glioma grades. Patients with grade 1 and 2 gliomas were categorized into the lower-grade group, whereas those with grade 3 and 4 gliomas were classified into the higher-grade group.
Classifier building and assessment
Three classical machine learning classifiers—k-nearest neighbors (kNN), support vector machines with a radial basis function kernel (r-SVM), and random forest (RF)—were constructed and compared.
Hyperparameter tuning
The dataset was initially split into training and test cohorts at a ratio of 3:1. Using accuracy as the evaluation metric, we tested different hyperparameters: KNN model with nNeighbors = [1: 40], SVM model with C = i × 5/100 (i= [1: 40]), and RF model with nEstimators = [1: 100]. We selected hyperparameters that demonstrated stable accuracy fluctuations as the final criteria.
Model development
Following feature selection and hyperparameter tuning, the selected features were used as inputs to develop machine learning classifiers. Each model was evaluated using five-fold cross-validation to ensure robust performance. The average predictive performance across the validation sets was assessed using accuracy and the area under the receiver operating characteristic (ROC) curve (AUC). The confidence interval (CI) is calculated by Gaussian distribution algorithm. Model stability was further quantified using the F1-score. The calibration evaluation utilized Brier Score, log loss, expected calibration error (ECE), and reliability curves. Additional evaluation metrics included sensitivity, specificity, positive predictive value, and negative predictive value. ROC curves were generated based on the cross-validation results to visualize classification performance. To assess clinical relevance, overall survival (OS) and progression-free survival (PFS) were analyzed using the Kaplan–Meier method, with differences between groups evaluated via the log-rank test.
All statistical analyses and machine learning model development were performed using Python version 3.11. Survival analyses were conducted using R version 4.3.1.
Results
General information
A total of 179 patients with gliomas were included in the final analysis. Detailed demographic and clinical characteristics are summarized in Table 1. The mean age of the patients was 52.09 (range: 3–86) years. Based on the WHO classification, gliomas were distributed as follows: grade I (n = 10), grade II (n = 22), grade III (n = 35), and grade IV (n = 112). The ICC values were all greater than 0.9. The enhancement index (EI), rADC, and ER were selected from the calculations of one radiologist. Table 1. Demographic and clinical characteristics of patients, along with ANOVA F-test results for feature selectionLow-Grade GliomaHigh-Grade GliomaP valuePatients(n)32147—Age37.28(5–81)55.31(3–86)0.000EI2.75(1.15–6.6)5.76(1.07–179.26)0.283rADC1.79(0.19–3.77)1.43(0.7–2.87)0.000ER1.48(0.87–3.14)1.64(0.8–3.19)0.118WBC6.46(4.03–10.7)7.15(3.58–25.14)0.207N3.93(1.99–8.71)4.74(1.36–23.81)0.139L1.95(1.1–3.55)1.78(0.18–4.5)0.216Plt229.59(142–357)210.72(78–401)0.118M0.77(0.24–5.8)0.47(0.1–1.11)0.011CRP3.15(0.2–61.87)5.72(0.2–218.02)0.524ALB40.1(34.1–48.9)39.04(26.4–50.4)0.108NLR2.3(0.83–7.82)4.08(0.52–132.28)0.366Dnlr1.31(1.19–1.91)1.42(1.09–7.39)0.258PLR129.52(57.36–324.55)141.9(34.36–1261.11)0.548PMR558.34(24.48–900)509.4(143.33–1883.33)0.327LMR4.6(0.26–8.9)4.17(0.16–14)0.266LCR5.47(0.02–17.75)3.25(0–17.9)0.008NCR8.55(0.14–18.2)7.21(0.06–49.3)0.436PNI49.85(39.6–62.35)47.93(33.65–67.8)0.063ALI559.54(98.1–1369.87)465.29(6.63–2125.61)0.136ALPC0.27(0–0.99)0.24(0–1.41)0.602SII565.47(170.97–2791.09)881.67(75.69–30027.06)0.480CAL0.07(0–1.65)0.27(0–18.98)0.536CAR0.09(0–1.81)0.16(0–7.12)0.543HALP0.01(0–0.02)0.01(0–0.09)0.017From age to HALP, the value stand for Mean(min-max)p-values are reported for each feature based on the ANOVA F-test^^p value < 0.05 N: Neutrophil count L: Lymphocyte count M: monocyte count Every abbreviation has been annotationed in Supplementary Material 1.
Feature selection
Based on the ANOVA F-test results (Table 1), significant variability was observed in several variables, including age, rADC, monocyte count, LCR, and HALP. Additionally, Pearson correlation coefficient analysis (Fig. 1) revealed multicollinearity or weak correlation with the target variable for certain features, specifically WBC, dNLR, PLR, NLR, CAL, and CAR, which were subsequently excluded. Consequently, the following features were retained for model training: rADC, monocyte count, LCR, and HALP. Fig. 1. Pearson correlation coefficient analysis used for feature selection
Hyperparameter tuning
For the KNN model, the number of neighbors was set to n_neighbors = 4. In the RF model, the number of trees was set to n_estimators = 80, and the minimum number of samples required to split an internal node was set to min_samples_split = 5. For the SVM model, the regularization parameter was set to C = 4. All other hyperparameters were kept at their default values. Figure 2 shows the variation of accuracy with hyperparameters. Fig. 2. The relationship curve between model accuracy and variation of hyperparameters. (a) KNN(k-nearest neighbor classification); (b) r-SVM(support vector machine); (c) RF(random forest)
Classification of low-grade (WHO I and II) and high-grade (WHO III and IV) gliomas
Evaluation of data feature differences between training and test cohorts
In the training and test cohorts used for 5-fold cross-validation, there were approximately 117 patients with high-grade glioma and 26 patients with low-grade glioma in the training cohorts, 30 patients with high-grade glioma and 6 patients with low-grade glioma in the test cohorts. The ANOVA F-test was conducted on the features between the training and test cohorts. Results are shown in the Table 2. Except for the fifth group HALP index p value was 0.04, the rest The p values were all greater than 0.05. Table 2. Five-fold cross-validation cohorts: low-grade glioma vs. high-grade gliomaTurntrain_LGGtrain_HGGtest_LGGtest_HGGrADC_pValMono_pValLCR_pValHALP_pVal1261176300.700.430.100.702261176300.260.580.150.213251187290.710.220.810.534251187290.120.540.870.805261186290.660.460.840.04train: training cohort; test: test cohort; _LGG: LGG count; _HGG: HGG count; _pVal: p value of ANOVA F test between training and test cohorts
Evaluation of the predictive performance of models
Based on the results of 5-fold cross-validation, all three machine learning models demonstrated strong predictive performance in both the training and test cohorts. The r-SVM classifier showed the best performance on the test set, achieving an AUC of 0.92, an accuracy of 0.84(CI: 0.78–0.90), a F1 score of 0.91, a brier score of 0.12 and a log score of 0.39. RF also performed well, with an AUC of 0.90, accuracy of 0.82(CI: 0.75–0.88), a F1 score of 0.89, a brier score of 0.13 and a log score of 0.41. In the decision analysis chart, all three models demonstrated excellent net treatment benefits. However, discrepancies were observed in their calibration curves. The SVM model exhibited significant fluctuations in calibration accuracy across different probability intervals. The RF model showed good calibration performance in medium-to-high prediction probability ranges but performed poorly in low probability zones. The KNN model demonstrated strong consistency between predicted probabilities and actual outcomes on its decision curve, indicating reliable predictive results. The performance metrics of all three classifiers across the cross-validation folds are summarized in Table 3. Figure 3 presents the ROC curves of the classifiers in the test cohort. Figure 4 presents the reliability diagram and decision curve in the test cohort. Fig. 3ROC curves for the three classifiers (KNN, r-SVM, and RF) based on five-fold cross-validation. (a) KNN; (b) r-SVM; (c) RFFig. 4The reliability diagram and decision curves for the three classifiers (KNN, r-SVM, and RF). (a) KNN; (b) r-SVM; (c) RFTable 3Five-fold cross-validation results for classification performance: low-grade glioma vs. high-grade glioma using KNN, SVM, and RF modelsKNNAccuracy_trainingAccuracy_testlower_testupper_testAUC_trainingAUC_testTP_trainingTN_trainingFP_trainingFN_trainingTP_testTN_testFP_testFN_testSensitivity_trainingSensitivity_testSpecificity_trainingSpecificity_testPPV_trainingPPV_testNPV_trainingNPV_testF1_score_trainingF1_score_testBrier_Score_testlog_Score_testECE_test10.870.830.770.890.950.9110916108273330.930.900.620.500.920.900.670.500.920.900.121.220.0320.890.830.770.890.960.9111116106282420.950.930.620.330.920.880.730.500.930.900.121.220.0130.900.830.770.890.960.8711315105282510.960.970.600.290.920.850.750.670.940.900.152.180.0640.890.720.650.800.970.831101788233460.930.790.680.430.930.850.680.330.930.820.183.150.0850.880.770.700.840.950.891091799252440.920.860.650.330.920.860.650.330.920.860.132.190.08min0.870.720.650.800.950.830.920.790.600.290.920.850.650.330.920.820.121.220.01mean0.880.800.730.860.960.880.940.890.630.380.920.870.700.470.930.880.141.990.05SVMAccuracy_trainingAccuracy_testlower_testupper_testAUC_trainingAUC_testTP_trainingTN_trainingFP_trainingFN_trainingTP_testTN_testFP_testFN_testSensitivity_trainingSensitivity_testSpecificity_trainingSpecificity_testPPV_trainingPPV_testNPV_trainingNPV_testF1_score_trainingF1_score_testBrier_Score_testlog_Score_testECE_test10.850.860.800.920.950.951175210301501.001.000.190.170.850.861.001.000.920.920.090.310.0820.890.830.770.890.940.9211611151300600.991.000.420.000.890.830.920.940.910.140.430.1230.850.830.770.890.950.911184210291601.001.000.160.140.850.831.001.000.920.910.130.410.0940.890.810.740.870.950.8711710151281610.990.970.400.140.890.820.910.500.940.890.140.450.0550.850.860.800.910.940.931176201282410.990.970.230.330.850.880.860.670.920.920.110.370.15min0.850.810.740.870.940.870.990.970.160.000.850.820.860.500.920.890.090.310.05mean0.870.840.780.900.950.920.990.990.280.160.860.840.940.790.930.910.120.390.10RFAccuracy_trainingAccuracy_testlower_testupper_testAUC_trainingAUC_testTP_trainingTN_trainingFP_trainingFN_trainingTP_testTN_testFP_testFN_testSensitivity_trainingSensitivity_testSpecificity_trainingSpecificity_testPPV_trainingPPV_testNPV_trainingNPV_testF1_score_trainingF1_score_testBrier_Score_testlog_Score_testECE_test10.970.810.740.871.000.891172150281521.000.930.810.170.960.851.000.330.980.890.130.380.0920.970.830.770.891.000.921172240291511.000.970.850.170.970.851.000.500.980.910.120.360.1430.960.830.770.891.000.921181960291601.001.000.760.140.950.831.001.000.980.910.110.360.1140.990.720.650.801.000.831182320251641.000.860.920.140.980.811.000.200.990.830.180.580.1850.970.890.830.941.000.921182240283311.000.970.850.500.970.901.000.750.980.930.100.350.14min0.960.720.650.801.000.831.000.860.760.140.950.811.000.200.980.830.100.350.09mean0.970.820.750.881.000.901.000.950.840.220.970.851.000.560.980.890.130.410.13_training: training cohort; _test: test cohortlower_test: lower confidence interval of test cohort accuracyupper_test: upper confidence interval of test cohort accuracyTP: true HGG; TN: true LGG; FP: false HGG; FN: false LGG; ECE: expected calibration error
Survival analysis of low-grade (WHO I and II) and high-grade (WHO III and IV) groups
The median OS and PFS for the entire cohort were 15 months and 8 months, respectively. Figure 3 illustrates the Kaplan–Meier survival curves for OS and PFS in the low-grade and high-grade groups. Patients in the low-grade group demonstrated significantly longer median OS than those in the high-grade group (60 months vs. 15 months; p < 0.0001; Fig. 5a) as well as significantly longer median PFS (60 months vs. 8 months; p < 0.0001; Fig. 5b). Fig. 5. Kaplan-Meier survival curves comparing low-grade and high-grade glioma groups. a OS; b PFS
Discussion
The tumor microenvironment in gliomas is often characterized by a sustained state of chronic inflammation, which enables the use of peripheral inflammatory markers to predict tumor behavior and progression [12–14]. MRI remains a critical tool in the diagnosis and evaluation of brain gliomas, with specific parameters such as the rADC and ER serving as important indicators of tumor type and grade [5, 15–17]. In this study, we found that combining radiological features with hematological and inflammatory parameters significantly improved the accuracy of glioma grade prediction. Notably, variables such as age, rADC, monocyte count, LCR, and HALP showed clear discriminatory power between low-grade and high-grade gliomas.
The discriminatory power of these features can be explained by their underlying biological and clinical relevance. Age is a well-established prognostic factor in gliomas, with older patients generally experiencing poorer outcomes [18]. This may be due to age-related changes in the tumor microenvironment, such as increased inflammation and decreased immune surveillance. The rADC, reflecting tumor cellularity and water diffusion, is inversely correlated with glioma grade, with lower rADC values indicating higher cellularity and malignancy [19]. Monocytes, as key components of the innate immune system, contribute to glioma-associated inflammation and angiogenesis, promoting tumor growth and invasion [20]. The lymphocyte-to-C-reactive protein ratio (LCR) and the hemoglobin-albumin ratio-lymphocyte percentage (HALP) are composite inflammatory markers that reflect the balance between systemic inflammation and nutritional status, both of which are known to influence glioma progression [21].
In the tumor microenvironment, glioma pathogenesis involves complex and dynamic interactions between cancer cells and the surrounding stroma, including glial and immune cells. Among the immune components, neutrophils and lymphocytes play a central role in modulating the inflammatory response [1, 22]. In our study, LCR and HALP exhibited significant differences between the high-grade and low-grade glioma groups. These findings are consistent with previous studies demonstrating the prognostic value of systemic inflammatory markers in gliomas [23]. However, our study extends these findings by demonstrating the combined predictive power of LCR and HALP with radiological features, suggesting that a multi-modal approach can provide a more comprehensive assessment of glioma biology.
SVM is a classical supervised learning algorithm. Its core idea is to find an optimal hyperplane (a straight line in two-dimensional space) to maximize the interval between different categories of data points, so as to improve the generalization ability of the model [24]. In this study, the SVM model exhibited superior predictive performance and stability compared with the KNN and RF models in distinguishing between high-grade and low-grade gliomas. This suggests that SVM is well-suited for analyzing complex, high-dimensional datasets and identifying non-linear relationships between features and outcomes.
This study advances current knowledge by demonstrating the feasibility of using a combination of readily available clinical, radiological, and hematological parameters to predict glioma grade with high accuracy. This approach addresses the gap in current clinical practice, where glioma grading relies primarily on invasive surgical biopsy and histopathological evaluation, which are subject to sampling bias and inter-observer variability. Our findings suggest that non-invasive biomarkers can potentially complement or even replace invasive procedures in certain clinical scenarios.
The potential clinical implications of our findings are significant. Accurate preoperative prediction of glioma grade can inform treatment planning, allowing clinicians to tailor therapy to the individual patient’s risk profile. For example, patients with predicted high-grade gliomas may benefit from more aggressive treatment strategies, such as maximal surgical resection followed by adjuvant chemoradiation. Conversely, patients with predicted low-grade gliomas may be managed with less aggressive approaches, such as observation or limited resection, to minimize treatment-related morbidity. Furthermore, our findings may facilitate the development of personalized risk stratification models that can be used to predict prognosis and guide clinical decision-making.
This study has some limitations. First, all patients were recruited from a single institution, which may limit the generalizability of the findings. Second, the retrospective nature of the analysis may introduce selection bias. Additionally, the relatively small number of imaging datasets prevented external validation, which is essential for confirming the robustness and applicability of the proposed models. Future research directions include external validation of our findings in multi-center cohorts, as well as the incorporation of molecular and genetic markers, such as IDH1 mutation status and MGMT promoter methylation, to further refine the predictive accuracy of our models. We also plan to explore the integration of other imaging modalities, such as perfusion MRI and MR spectroscopy, to provide a more comprehensive assessment of glioma biology. Finally, we aim to develop a user-friendly clinical decision support tool that can be used by clinicians to predict glioma grade and guide treatment planning.
Conclusions
The combination of MRI features and inflammatory markers, analyzed through machine learning models like SVM, holds significant potential for refining glioma prognosis and guiding personalized treatment strategies.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Supplementary Material 1
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Foltyn-Dumitru M, Schell M, Sahm F, Kessler T, Wick W, Bendszus M, Rastogi A, Brugnara G, Vollmuth P. Advancing noninvasive glioma classification with diffusion radiomics: exploring the impact of signal intensity normalization. Neurooncol Adv. 2024, Mar, 22;6(1): vdae 043. 10.1093/noajnl/vdae 043. PMID: 38596719; PMCID: PMC 11003539.10.1093/noajnl/vdae 043PMC 1100353938596719 · doi ↗ · pubmed ↗
