OMT and tensor SVD–based deep learning model for segmentation and predicting genetic markers of glioma: A multicenter study
Zhengyang Zhu, Han Wang, Tiexiang Li, Tsung-Ming Huang, Huiquan Yang, Zhennan Tao, Zhong-Heng Tan, Jianan Zhou, Sixuan Chen, Meiping Ye, Zhiqiang Zhang, Feng Li, Dongming Liu, Maoxue Wang, Jiaming Lu, Wen Zhang, Xin Li, Qian Chen, Zhuoru Jiang, Futao Chen, Xin Zhang, Wen-Wei Lin

TL;DR
This study introduces a deep learning model using OMT and tensor SVD to accurately segment and predict genetic markers of glioma from MRI scans, outperforming radiologists.
Contribution
A novel deep learning model combining optimal mass transport and multimode tensor SVD for glioma segmentation and genetic marker prediction.
Findings
The OMT segmentation model achieved a mean Dice score of 0.880 for tumor regions.
The OMT-APC model outperformed four radiologists in predicting WHO grade, IDH mutation, and 1p/19q codeletion with high accuracy and AUC scores.
The model demonstrated strong generalizability across 16 multicenter datasets from Asia, Europe, and America.
Abstract
Accurate characterization of glioma is essential for effective clinical decision-making. Most current studies involve a limited number of patients and focus solely on single-gene tasks. This research introduces a novel deep learning model based on OMT and multimode tensor SVD to predict molecular markers using international multicenter datasets. Our approach efficiently compresses irrelevant information while enhancing tumor-region features through OMT. Additionally, we innovatively integrate an algebraic preclassification model, derived from multimode tensor SVD, with deep learning networks. This combination significantly improves the model’s ability to recognize tumor and classify genetic subtypes. Experimental validation on multicenter datasets demonstrates that our method is highly reproducible and generalizable, offering promising potential for glioma analysis and clinical…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Fig. 1
Fig. 2
Fig. 3
Fig. 4|
Total n = 3,565 |
Training-internal validation set n = 2,551 |
Independent external test set n = 1,014 | |
|---|---|---|---|
| Age | 55.91 ± 14.79 | 57.22 ± 14.50 | 52.51 ± 15.00 |
| Sex | |||
| Female | 1,468 (41.18%) | 1,041 (40.81%) | 427 (42.11%) |
| Male | 2,016 (58.55) | 1,509 (59.15%) | 507 (50.00%) |
| Unknown | 81 (2.27%) | 1 (0.04%) | 80 (7.89%) |
| IDH status | |||
| Mutated | 836 (23.45%) | 498 (19.52%) | 338 (33.33%) |
| Wildtype | 2,026 (56.83%) | 1,558 (61.07%) | 468 (46.15%) |
| Unknown | 703 (19.72%) | 495 (19.40%) | 208 (20.51%) |
| 1p/19q codeletion status | |||
| Codeleted | 291 (8.16%) | 144 (5.64%) | 147 (14.50%) |
| Intact | 488 (13.69%) | 308 (12.07%) | 180 (17.75%) |
| Unknown | 2,786 (78.15%) | 2,099 (82.28%) | 687 (67.75%) |
| WHO Grade | |||
| WHO Grade 2 | 596 (16.72%) | 353 (13.84%) | 243 (23.96%) |
| WHO Grade 3 | 341 (9.57%) | 207 (8.11%) | 134 (13.21%) |
| WHO Grade 4 | 2,613 (73.30%) | 1,976 (77.46%) | 637 (62.82%) |
| Unknown | 15 (0.42%) | 15 (0.59%) | 0 (0.00%) |
| Histological diagnosis | |||
| Glioblastoma | 2,509 (70.38%) | 1,902 (75.46%) | 607 (59.86%) |
| Astrocytoma | 578 (16.21%) | 430 (16.86%) | 148 (14.60%) |
| Oligodendroglioma | 419 (11.75%) | 160 (6.27%) | 259 (25.54%) |
| Unknown | 59 (1.65%) | 59 (2.31%) | 0 (0.00%) |
| Molecular diagnosis | |||
| Glioblastoma, IDH-wildtype | 2,026 (56.83%) | 1,558 (61.07%) | 468 (46.15%) |
| Glioblastoma, NOS | 509 (14.28%) | 384 (15.05%) | 125 (12.33%) |
| Astrocytoma, IDH-mutated | 506 (14.19%) | 324 (12.70%) | 182 (17.95%) |
| Astrocytoma, NOS | 133 (3.73%) | 68 (2.67%) | 65 (6.41%) |
| Oligodendroglioma, IDH-mutated | 291 (8.16%) | 144 (5.64%) | 147 (14.50%) |
| Oligodendroglioma, NOS | 41 (1.15%) | 14 (0.55%) | 27 (2.66%) |
| Unknown | 59 (1.65%) | 59 (2.31%) | 0 (0.00%) |
|
Method | Lesion-wise Dice score | Lesion-wise HD95 | ||||||
|---|---|---|---|---|---|---|---|---|
| WT | TC | ET | Mean | WT | TC | ET | Mean | |
| Winner | 0.9101 | 0.8673 | 0.8509 | 0.8761 | 11.11 | 14.47 | 17.70 | 14.43 |
| Third place | 0.9063 | 0.8627 | 0.8432 | 0.8707 | 11.70 | 13.10 | 17.37 | 14.06 |
| OMT nnU-Net | 0.9082 | 0.8758 | 0.8566 | 0.8802 | 11.27 | 12.83 | 13.65 | 12.58 |
| Dataset | Model/reader | AUC (95% CI) | ACC | SENS | SPEC | PPV | NPV | F1-score |
|---|---|---|---|---|---|---|---|---|
|
Internal validation | MedTrans | 0.892 (0.862–0.922) | 0.922 | 0.953 | 0.832 | 0.944 | 0.856 | 0.948 |
| Effnet | 0.873 (0.841–0.906) | 0.908 | 0.943 | 0.804 | 0.934 | 0.827 | 0.939 | |
| PSPnet | 0.892 (0.862–0.922) | 0.913 | 0.934 | 0.850 | 0.949 | 0.812 | 0.941 | |
| SE-ResNet | 0.893 (0.851–0.934) | 0.920 | 0.975 | 0.811 | 0.910 | 0.943 | 0.941 | |
| OMT-APC | 0.917 (0.891–0.942) | 0.906 | 0.888 | 0.946 | 0.974 | 0.788 | 0.929 | |
|
External test | MedTrans | 0.767 (0.738–0.795) | 0.807 | 0.923 | 0.610 | 0.800 | 0.824 | 0.857 |
| Effnet | 0.758 (0.728–0.787) | 0.801 | 0.926 | 0.589 | 0.792 | 0.825 | 0.854 | |
| PSPnet | 0.757 (0.728–0.786) | 0.793 | 0.896 | 0.618 | 0.799 | 0.779 | 0.845 | |
| SE-ResNet | 0.771 (0.743–0.799) | 0.814 | 0.937 | 0.605 | 0.800 | 0.851 | 0.863 | |
| OMT-APC | 0.807 (0.780–0.833) | 0.830 | 0.900 | 0.714 | 0.841 | 0.808 | 0.869 | |
|
TCGA test | MedTrans | 0.771 (0.713–0.829) | 0.818 | 0.930 | 0.612 | 0.816 | 0.825 | 0.869 |
| Effnet | 0.759 (0.699–0.818) | 0.806 | 0.917 | 0.600 | 0.809 | 0.797 | 0.860 | |
| PSPnet | 0.771 (0.713–0.829) | 0.798 | 0.860 | 0.682 | 0.833 | 0.725 | 0.846 | |
| SE-ResNet | 0.783 (0.726–0.799) | 0.831 | 0.943 | 0.624 | 0.822 | 0.855 | 0.878 | |
| OMT-APC | 0.845 (0.799–0.893) | 0.855 | 0.879 | 0.812 | 0.896 | 0.784 | 0.887 | |
| Radgiologist1 | 0.760 (0.701–0.819) | 0.798 | 0.892 | 0.628 | 0.814 | 0.761 | 0.851 | |
| Radgiologist2 | 0.705 (0.640–0.771) | 0.745 | 0.841 | 0.570 | 0.781 | 0.662 | 0.810 | |
| Radgiologist3 | 0.698 (0.631–0.764) | 0.753 | 0.885 | 0.512 | 0.768 | 0.710 | 0.822 | |
| Radgiologist4 | 0.799 (0.745–0.834) | 0.819 | 0.866 | 0.733 | 0.855 | 0.750 | 0.861 |
| Dataset | Model/reader | AUC (95% CI) | ACC | SENS | SPEC | PPV | NPV | F1-score |
|---|---|---|---|---|---|---|---|---|
|
Internal validation | MedTrans | 0.867 (0.827–0.907) | 0.886 | 0.801 | 0.932 | 0.867 | 0.895 | 0.833 |
| Effnet | 0.878 (0.840–0.917) | 0.903 | 0.795 | 0.962 | 0.921 | 0.895 | 0.853 | |
| PSPnet | 0.884 (0.846–0.922) | 0.900 | 0.829 | 0.940 | 0.883 | 0.909 | 0.855 | |
| SE-ResNet | 0.892 (0.856–0.929) | 0.915 | 0.815 | 0.970 | 0.937 | 0.905 | 0.871 | |
| OMT-APC | 0.963 (0.941–0.985) | 0.968 | 0.945 | 0.981 | 0.965 | 0.970 | 0.955 | |
|
External test | MedTrans | 0.815 (0.772–0.858) | 0.855 | 0.734 | 0.896 | 0.707 | 0.908 | 0.720 |
| Effnet | 0.805 (0.761–0.848) | 0.852 | 0.709 | 0.900 | 0.709 | 0.900 | 0.709 | |
| PSPnet | 0.846 (0.806–0.886) | 0.869 | 0.797 | 0.894 | 0.720 | 0.928 | 0.757 | |
| SE-ResNet | 0.855 (0.816–0.894) | 0.895 | 0.772 | 0.937 | 0.808 | 0.923 | 0.789 | |
| OMT-APC | 0.879 (0.843–0.915) | 0.913 | 0.810 | 0.948 | 0.842 | 0.936 | 0.826 | |
|
TCGA test | MedTrans | 0.813 (0.752–0.873) | 0.826 | 0.742 | 0.884 | 0.815 | 0.832 | 0.776 |
| Effnet | 0.832 (0.774–0.890) | 0.844 | 0.764 | 0.899 | 0.840 | 0.847 | 0.800 | |
| PSPnet | 0.839 (0.782–0.896) | 0.849 | 0.787 | 0.891 | 0.833 | 0.858 | 0.809 | |
| SE-ResNet | 0.853 (0.813–0.883) | 0.867 | 0.775 | 0.930 | 0.885 | 0.857 | 0.826 | |
| OMT-APC | 0.908 (0.863–0.952) | 0.917 | 0.854 | 0.961 | 0.938 | 0.905 | 0.894 | |
| Radgiologist1 | 0.582 (0.504–0.660) | 0.633 | 0.303 | 0.860 | 0.600 | 0.642 | 0.403 | |
| Radgiologist2 | 0.764 (0.697–0.830) | 0.761 | 0.775 | 0.752 | 0.683 | 0.829 | 0.726 | |
| Radgiologist3 | 0.664 (0.590–0.739) | 0.702 | 0.461 | 0.868 | 0.707 | 0.700 | 0.558 | |
| Radgiologist4 | 0.771 (0.705–0.837) | 0.784 | 0.697 | 0.845 | 0.756 | 0.801 | 0.725 |
| Dataset | Model/reader | AUC (95% CI) | ACC | SENS | SPEC | PPV | NPV | F1-score |
|---|---|---|---|---|---|---|---|---|
|
Internal validation | MedTrans | 0.701 (0.588–0.831) | 0.789 | 0.500 | 0.919 | 0.737 | 0.803 | 0.596 |
| Effnet | 0.753 (0.638–0.869) | 0.822 | 0.571 | 0.935 | 0.800 | 0.829 | 0.667 | |
| PSPnet | 0.684 (0.560–0.808) | 0.767 | 0.464 | 0.903 | 0.684 | 0.789 | 0.553 | |
| SE-ResNet | 0.771 (0.671–0.898) | 0.844 | 0.571 | 0.968 | 0.889 | 0.833 | 0.695 | |
| OMT-APC | 0.873 (0.810–0.944) | 0.900 | 0.808 | 0.938 | 0.840 | 0.924 | 0.824 | |
|
External test | MedTrans | 0.493 (0.394–0.593) | 0.608 | 0.167 | 0.820 | 0.308 | 0.672 | 0.216 |
| Effnet | 0.525 (0.424–0.625) | 0.628 | 0.229 | 0.820 | 0.379 | 0.689 | 0.286 | |
| PSPnet | 0.514 (0.414–0.614) | 0.622 | 0.208 | 0.820 | 0.357 | 0.683 | 0.263 | |
| SE-ResNet | 0.581 (0.481–0.681) | 0.682 | 0.292 | 0.870 | 0.519 | 0.719 | 0.373 | |
| OMT-APC | 0.681 (0.585–0.776) | 0.730 | 0.542 | 0.820 | 0.591 | 0.788 | 0.565 | |
|
TCGA test | MedTrans | 0.565 (0.433–0.697) | 0.685 | 0.259 | 0.871 | 0.467 | 0.730 | 0.333 |
| Effnet | 0.621 (0.490–0.751) | 0.719 | 0.370 | 0.871 | 0.556 | 0.761 | 0.444 | |
| PSPnet | 0.602 (0.471–0.733) | 0.708 | 0.333 | 0.871 | 0.529 | 0.750 | 0.409 | |
| SE-ResNet | 0.668 (0.518–0.776) | 0.742 | 0.481 | 0.855 | 0.591 | 0.791 | 0.530 | |
| OMT-APC | 0.769 (0.653–0.884) | 0.809 | 0.667 | 0.871 | 0.692 | 0.857 | 0.679 | |
| Radgiologist1 | 0.478 (0.346–0.609) | 0.517 | 0.385 | 0.571 | 0.270 | 0.692 | 0.317 | |
| Radgiologist2 | 0.543 (0.409–0.676) | 0.640 | 0.308 | 0.778 | 0.364 | 0.731 | 0.333 | |
| Radgiologist3 | 0.446 (0.317–0.576) | 0.472 | 0.385 | 0.508 | 0.244 | 0.667 | 0.299 | |
| Radgiologist4 | 0.443 (0.313–0.572) | 0.371 | 0.615 | 0.270 | 0.258 | 0.630 | 0.364 |
- —National Science and Technology Innovation 2030
- —MOST | National Natural Science Foundation of China (NSFC)501100001809
- —MOST | National Natural Science Foundation of China (NSFC)501100001809
- —MOST | National Natural Science Foundation of China (NSFC)501100001809
- —Jiangsu Provincial Scientific Research Center of Applied Mathematics
- —Shanghai Institute for Mathematics and Interdisciplinary Sciences
- —National Science and Technology Council (NSTC)100020595
- —Fundings for Clinical Trials from the Affiliated Drum Tower Hospital, Medical School of Nanjing University
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification
Glioma is the most common primary malignant tumor in adults (1). The response to treatment and survival outcomes are influenced by the histological and genetic characteristics of gliomas, particularly tumor grade, isocitrate dehydrogenase (IDH) mutation status, and 1p/19q codeletion (2, 3). In 2021, the World Health Organization (WHO) 5th edition classification of central nervous system (CNS) tumors classified adult diffuse gliomas into three subgroups based on IDH mutation and 1p/19q codeletion status: glioblastoma, IDH-wildtype; astrocytoma, IDH-mutant, and 1p/19q-intact; and oligodendroglioma, IDH-mutant, and 1p/19q-codeleted (4). Numerous studies have shown that IDH-mutant gliomas are less aggressive and associated with a better prognosis than IDH-wildtype glioblastomas. Low-grade gliomas typically exhibit better treatment responses and more prolonged survival than high-grade gliomas. Therefore, accurately predicting the WHO grade, IDH mutation status, and 1p/19q-codeletion status of gliomas is crucial for informing prognosis and guiding treatment decisions (5).
Currently, histological and genetic information is obtained through the analysis of tumor tissue from surgery or biopsy, a costly and time-consuming process. For patients whose tumors cannot be safely resected, tumor samples may not be available, precluding further genetic analysis. As a result, there is an increasing demand for noninvasive methods that can provide crucial histological and genetic information about gliomas. MRI has become a key tool in preoperative diagnosis and evaluation of brain tumors (6?–8). MRI can reveal glioma morphology and imaging heterogeneity within the tumor, which correlates with histological and genetic characteristics (9). With rapid advancements in computational technology, radiomics and deep learning techniques have gained widespread use in clinical decision-making systems, enhancing diagnosis and patient care across various clinical settings (10?–12). Several pilot studies have explored the potential of deep learning models to predict genetic mutation status (13?–15). However, these studies have been limited by small sample sizes (often fewer than 1,000 patients) and typically focus on predicting a single genetic feature at a time.
This study utilized a deep learning approach to predict WHO grade, IDH mutation status, and 1p/19q codeletion status based on tumor regions identified in preoperative MRI data. We incorporated preoperative two-dimensional (2D) or three-dimensional (3D) T_1_-weighted contrast-enhanced (T_1_CE) and T_2_-weighted imaging (T_2_WI) MRI data from 3,565 glioma patients across 16 datasets. However, the dataset presented challenges due to the mixture of 2D and 3D data and the lack of tumor labels in most cases, increasing the complexity of classification. We proposed the optimal mass transport (OMT) method to normalize multicenter datasets and segment the tumor region using deep learning to address these issues (16, 17). Unlike traditional techniques for brain tumor classification and segmentation, the OMT method achieves comprehensive normalization across multicenter datasets, offering superior generalization, robustness, and interpretability, making it suitable for integration with deep learning models.
A key technical challenge in deep learning for tumor region segmentation is the GPU memory limitation caused by the large size of MRI brain images. Traditional approaches, such as random cropping of raw brain images, often risk omitting critical information. We introduced the OMT technique to preserve the global structure of MRI data. OMT transforms MRI brain images into m × m × m tensors with minimal distortion, maintaining the overall 3D structure of the MRI data. The OMT density function enhances the tumor region while preserving the volume of nontumor regions within the OMT tensor, thereby improving segmentation performance.
This study utilized deep learning on OMT tumor regions to predict WHO grade, IDH mutation status, and 1p/19q codeletion status. We developed an algebraic preclassification (APC) model that leverages tumor region tensors embedded within OMT tensors and employs multimode tensor SVD for processing testing data. The multimode SVD offers an efficient and reliable algebraic preclassification model for probability estimation. By integrating the OMT technique with the APC model (OMT-APC) into a SE-ResNet model, we significantly improved the performance of the deep learning architecture. In contrast to traditional methods that focus on extracting radiomics features, statistical characteristics, or other information from tumor regions, the OMT-APC framework provides a more efficient and robust method for classification tasks.
Result
Baseline Patients’ Characteristics.
We enrolled a total of 3,565 patients in our study. The patients in the training set originated from 5 different datasets, and the test set data were collected from 11 different datasets, respectively. Table 1 illustrates a full overview of the baseline characteristics in the training and test set and Fig. 1 provides the inclusion–exclusion flowchart and the distribution of the patients from different datasets in the training set and test set.
Dataset distribution and patient enrollment flowchart.
Tumor Segmentation.
The BraTS 2023 Adult Glioma Challenge dataset (18?–20) includes 1,251 labeled training brain images and 219 unlabeled brain images for online validation. We evaluated the performance of our OMT-based nnU-Net model for the whole tumor (WT), tumor core (TC), and enhanced tumor (ET) regions, comparing it with the top-performing models from the BraTS 2023 challenge. Specifically, we benchmarked our results against the winning model (21) and the third-place model (22), as summarized in Table 2. Notably, while the winner team employed generative adversarial networks and registration techniques to create an extensive dataset of 23,049 training samples, our approach—utilizing only 1,251 training samples with four density parameters—consistently delivered results that either outperformed or matched those researches.
Model Capacity to Predict WHO Grade of Adult Diffuse Gliomas.
On internal validation set, OMT-APC model achieved AUC of 0.917 and ACC of 0.906, and SE-ResNet model achieved AUC of 0.893 and ACC of 0.920. On the External test set, OMT-APC model achieved AUC of 0.807 and ACC of 0.830, and SE-ResNet model achieved AUC of 0.771 and ACC of 0.814. On TCGA test set, OMT-APC model achieved AUC of 0.845 and ACC of 0.855, and SE-ResNet model achieved AUC of 0.783 and ACC of 0.831. As shown in Table 3 and Fig. 2, we can observe that SE-ResNet outperformed MedTrans, Effnet, and PSPnet, and our APC method can enhance the performance of the SE-ResNet model in the WHO grade classification task. Through Fig. 3 and Table 3, we can observe that our method outperformed 4 radiologists.
(A) OMT process of brain images. (B) Flowchart of the study design.
Radar maps comparing the performance of SE-ResNet Model, OMT-APC Model, and Radiologists in predicting WHO Grade, IDH mutation, and 1p/19q codeletion status.
Model Capacity to Predict IDH Mutation Status of Adult Diffuse Gliomas.
On internal validation set, the OMT-APC model achieved an AUC of 0.963 and ACC of 0.968, and the SE-ResNet model achieved an AUC of 0.892 and ACC of 0.915. On External test set, the OMT-APC model achieved an AUC of 0.879 and ACC of 0.913, and the SE-ResNet model achieved an AUC of 0.855 and ACC of 0.895. On TCGA test set, the OMT-APC model achieved an AUC of 0.908 and ACC of 0.917, and the SE-ResNet model achieved an AUC of 0.853 and ACC of 0.867. As shown in Table 4 and Fig. 2, we can observe that SE-ResNet outperformed MedTrans, Effnet, and PSPnet and our APC method can significantly enhance the performance of the SE-ResNet model in the IDH mutation status classification task. Through Fig. 3 and Table 4, we can observe that our method outperformed 4 radiologists.
Model Capacity to Predict 1p/19q Codeletion Status of Adult Diffuse Gliomas.
On internal validation set, the OMT-APC model achieved an AUC of 0.873 and ACC of 0.9, and the SE-ResNet model achieved AUC of 0.771 and ACC of 0.844. On External test set, OMT-APC model achieved AUC of 0.681 and ACC of 0.73, and SE-ResNet model achieved AUC of 0.581 and ACC of 0.682. On TCGA test set, the OMT-APC model achieved AUC of 0.769 and ACC of 0.809, and the SE-ResNet model achieved an AUC of 0.668 and ACC of 0.742. As shown in Table 5 and Fig. 2, we can observe that SE-ResNet outperformed MedTrans, Effnet, and PSPnet and our APC method can significantly enhance the performance of the SE-ResNet model in the 1p/19q codeletion status classification task. Through Fig. 3 and Table 5, we can observe that our method outperformed 4 radiologists.
Discussion
This study developed a deep learning OMT-APC model that integrates OMT and multimode tensor SVD to predict the WHO grade, IDH mutation, and 1p/19q codeletion status of adult diffuse gliomas using preoperative MR images.
The OMT method preserved the global structure of 3D MRI brain images while enlarging the tumor region within the OMT tensor. By varying parameters in the density function, OMT facilitated data augmentation and ensured reliable tumor segmentation (16, 17). By weighting the tumor region through the density function, we are able to amplify the spatial proportion of the tumor region on the tensor image, successfully enhancing the segmentation accuracy of TC and ET. In classification tasks, amplifying the tumor region also facilitates information extraction, helping the AI model deeply mine molecular genetic information in the images. Additionally, the OMT method standardized on multicenter imaging datasets can stably eliminate differences between data from different centers and exhibits strong robustness.
The truncated SVDs of the s-mode unfolding matrices (s = 1, 2, 3) of the OMT tensors, coupled with a multiview orthonormalized partial least squares optimization approach, provided an algebraic preclassification model. This model estimated the probabilities of preclassifications for testing data, enhancing classification accuracy. The APC model eliminated the need for neural network pretraining, providing preclassification data for deep learning tasks while significantly reducing computational time.
We trained our OMT-APC model on MRI data from 2,551 glioma patients across five centers. When tested on the publicly available TCGA dataset, the model achieved an AUC of 0.845 for WHO grade prediction, 0.908 for IDH mutation status prediction, and 0.769 for 1p/19q codeletion status prediction. Choi developed a deep convolutional neural network (CNN)–based deep learning model with 1166 glioma patients’ images and obtained an AUC of 0.86 on TCGA test set (23). Van der Voort developed another CNN model using a patient cohort of 1508 glioma patients and also tested on TCGA set, achieving an AUC of 0.81, 0.90, and 0.85 for WHO Grade, IDH mutation, and 1p/19q codeletion task, respectively (24). Our method outperformed previous research and obtained higher performance in multiclassification tasks on the same TCGA test set. This OMT-APC model enabled automated tumor segmentation and multitask genetic classification across diverse populations, MRI machine vendors, and imaging protocols, outperforming radiologists and demonstrating promising potential for broad clinical application.
The WHO CNS5 classification emphasizes the critical role of molecular diagnosis in adult diffuse gliomas (25). IDH mutation and 1p/19q codeletion status are key factors in glioma classification (26). IDH mutations are associated with a more favorable prognosis and better treatment response, regardless of histological presentation (27). Additionally, 1p/19q codeletion distinguishes oligodendroglioma from astrocytoma in IDH-mutated glioma patients (28). Identifying these molecular markers is essential for therapeutic decisions and clinical management (29).
Our OMT-APC model is particularly beneficial for patients where tumor resection is unsafe and burr-hole biopsy is performed solely for diagnostic purposes. In these cases, the model provides a noninvasive alternative. Furthermore, in situations where sample bias creates diagnostic uncertainty, the model can serve as an additional check for histological results.
Comparing with previous methods for the segmentation and classification of brain tumors, the OMT method can achieve complete normalization on multicenter datasets, demonstrating that generalization, robustness, and visibility which is suitable for the CNN model. The multimode SVD provides an efficient algebraic preclassification model for the probability estimation which can be used to significantly enhance classification accuracy. Moreover, our approach relies solely on 2D or 3D T_1_CE and T_2_WI images for segmentation and classification tasks without incorporating advanced modalities such as diffusion-weighted imaging, perfusion-weighted imaging, or susceptibility-weighted imaging. This highlights the strong generalizability of the OMT-APC method in medical imaging applications.
Future improvements will focus on enhancing the accuracy of the OMT method in segmenting the WT region. The research will focus on identifying the optimal density function within OMT by leveraging machine learning to maximize the extraction of information from suspected tumor regions in MR images, thereby enhancing edge recognition of the WT region. The OMT-APC approach incorporates both geometric transformations and algebraic features of MR images, enabling effective tumor segmentation and classification. Consequently, future efforts will also aim to develop advanced algebraic and geometric techniques to extract feature information from other medical imaging modalities, supporting image-based gene classification, medical diagnosis, and treatment planning.
Materials and Methods
Data Collection.
We enrolled a total of 3565 patients in our study and conducted a retrospective analysis by gathering preoperative contrast-enhanced T_1_CE and T_2_WI from adult diffuse gliomas patients undergoing surgery or biopsy. This research was approved by the Medical Ethical committee and the Institutional Review Board of Nanjing Drum Tower Hospital (2022-364-02) and registered on Clinical Trial (NCT05624736). The written informed consent was waived due to the retrospective nature of this study.
Four private datasets from China between January 2018 and December 2023 included Nanjing Drum Tower Hospital (NJDTH), Jinling Hospital (JLH), Nanjing Brain Hospital (NBH), and LuAn People’s Hospital (LAH). Two publicly available datasets from Europe included the Erasmus Glioma database (EGD) dataset (30) and the LUMIERE dataset (31). Ten publicly available datasets from America included the University of Pennsylvania glioblastoma (UPenn-GBM) dataset (32), the University of California San Francisco Preoperative Diffuse Glioma MRI Dataset (UCSF-PDGM) dataset (33), the ReMIND dataset (34), the Ivy Glioblastoma Atlas (Ivy-GAP) dataset (35), the Rembrandt brain cancer dataset (REMBRANDT) (36), the National Cancer Institute Clinical Proteomic Tumor Analysis Consortium Glioblastoma Multiforme (CPTAC-GBM) dataset, the Low grade glioma 1p19qDeletion (1p19q-LGG) dataset (37), the BraTS2020 dataset, The Cancer Genome Atlas Glioblastoma Multiforme (TCGA-GBM) dataset, and the Cancer Genome Atlas Low Grade Glioma (TCGA-LGG) dataset (18). The cases in the BraTS2020 that overlap with TCGA-GBM and TCGA-LGG were excluded. NJDTH, EGD, Upenn-GBM, UCSF-PDGM and ReMIND were designated as training and internal validation set. JLH, NBH, LAH, LUMIERE, Ivy-GAP, REMBRANDT, CPTAC-GBM, 1p19q-LGG, BraTS2020, TCGA-GBM, and TCGA-LGG were designated as external validation set.
Our inclusion criteria were as follows: i) newly diagnosed and pathologically confirmed glioma. ii) available T_1_CE and T_2_WI images within 2 wk before surgery or biopsy. iii) Known WHO grade, IDH mutation or 1p/19q codeletion status. Exclusion criteria included i) lacking preoperative images; ii) lacking T_1_CE or T_2_WI sequence. Therefore, a total of 2551 patients were included in the training and internal validation set with a ratio of 4:1, and 1014 patients were included in the external test set.
Image Processing and OMT.
The image data used in this study were sourced from a 16-center dataset and underwent various preprocessing methods due to the lack of raw MRI data for standardized processing. The raw images were converted to standardized skull-stripped brain images through the following steps:
- 1.Resampling: Both 2D and 3D T_2_WI and T_1_CE were resampled to a standard 3D voxel space with a resolution of .
- 2.Intensity Normalization: Image grayscale values were normalized to the range [0, 1,000].
- 3.Bias Field Correction: Bias field distortions were corrected.
- 4.Image Registration: T_2_WI was used as the reference image, and T_1_CE images were rigidly coregistered to the T_2_WI modality (38, 39).
- 5.Brain Extraction: A combination of BET and neural network models was applied to extract the brain region, removing the skull, eyes, nose, and neck to mitigate their impact on brain image analysis (40).
Finally, the preprocessed images were further analyzed using OMT techniques.
Consider a discrete simplicial manifold with a tetrahedral mesh representation of the above 3D brain image. This manifold comprises sets of vertices and tetrahedrons . Let represent the normalized grayscale values derived from contrast-enhanced histogram equalization of the brain image, and let denote the grayscale value at vertex . Denote WT region as . A density function at is defined as follows:
where is a hyperparameter. We further define at tetrahedron and the local volume measure at vertex by and , where is the set of 1-ring neighboring tetrahedrons of , and represents the volume of tetrahedron .
Using , a mass-preserving discrete OMT map is proposed, as detailed in (17, 41, 42), to transform into a discrete unit sphere . This map is defined by
where the set of feasible maps is
Similarly, the discrete solid cube , consisting of vertices and tetrahedrons , can be transformed into using volume-preserving OMT map :
where
By composing these maps, enables the transformation of the irregular domain into the solid cube , preserving mass. The resulting 3D OMT tensor derived from can serve as input data for deep learning models.
As illustrated in Fig. 4, the OMT map transforms the irregular brain image into an tensor with minimal distortion, preserving the global structure of the 3D MR image. The density function defined in Eq. 1 enhances the tumor region while maintaining the volume of the nontumor region in the OMT tensor. Furthermore, varying the parameter allows for data augmentation, increasing diversity and mitigating overfitting during model training.
ROC curves for OMT-APC model and Radiologists in predicting WHO Grade, IDH mutation, and 1p/19q codeletion status.
Automated Tumor Segmentation.
In this study, the fully automated nnU-Net (43) was utilized for model training, incorporating fivefold cross-validation for each network. This updated version of nnU-Net includes a comprehensive preprocessing and data augmentation pipeline. To further enhance the training data, we augment it from to using OMT tensors with values of . To accommodate this increase in data, the network size is adjusted asymmetrically, doubling the number of filters in the encoder while maintaining the number of filters in the decoder. The maximum number of filters is also increased to 512. Additionally, all instance normalization layers are replaced with group normalization layers, setting the number of groups to 32.
Algebraic Preclassification Model.
Let for represent the OMT image tensors with of 1.75 in the training data. Let denote the associated label vector, which can be either or . Define and as the number of instances where and , respectively, so that .
For each tensor , we extract a smaller tensor corresponding to the WT region. Using these tumor region tensors for , we propose a preclassification model that estimates the probability function for the testing data. These estimated probabilities can then be incorporated into the training model to improve classification accuracy.
Define the s-mode unfolding matrix of as for . We approximate using the truncated SVD:
where with . We then vectorize to obtain , where . We define the matrices with and , where with , if ; , if . This weighting matrix balances the contributions of the two label classes during the optimization in Eq. 2 below.
We then formulate a multiview orthonormalized partial least squares (MvOPLS) optimization problem, incorporating a Tikhonov regularize:
Let and As the derivations in (44), the optimizer of Eq. 2 can be found by solving the generalized eigenvalue problem:
where consists of the normalized eigenvectors of the matrix pair corresponding to the largest eigenvalues. Once is computed, we can obtain the optimizer for Eq. 2:
For the testing data, we have a 3D OMT image tensor . We use an OMT-based nnU-Net model to segment the tumor region in , and then crop the corresponding tensor . Using the pretrained MvOPLS model, we compute the probability approximations for the preclassification of :
where is the vectorization of the truncated SVD of the s-mode unfolding matrix of . For each patient, this results in six algebraic probability preclassifications based on the two modalities (T_1_CE and T_2_WI). The Tikhonov regularization parameters are typically set to .
SE-ResNet with Algebraic Preclassification (APC-SE-ResNet).
The middle and right sections of Fig. 4B depict the SE-ResNet architecture(45) integrated with a pretrained MvOPLS model for the classification tasks of predicting the IDH status, 1p/19q codeletion, and WHO grade. The top-middle blue box shows that the SE-ResNet architecture uses OMT tensors from T_1_CE, T_2_WI, and tumor labels as inputs. In the bottom-middle blue box, the architecture concatenates the probability vectors generated by the MvOPLS preclassification model with the output vector from the fully connected (FC) layer. This concatenated vector is subsequently passed onto the next FC layer for further processing.
Due to the class imbalance in the IDH, 1p/19q, and WHO grade tasks, we use cross-entropy loss as the training loss function. To address the imbalance issue, we also apply label-smoothing regularization to adjust the ground-truth distribution:
where is the label-smoothing parameter and is the number of classes. For our classification tasks, we set .
We adopt the AdaMax optimizer, a variant of Adam using the infinity norm. The initial learning rate is and decreases by a factor of 0.96 every 10 epochs. The model is trained for up to 500 epochs.
Fig. 4 shows the entire workflow of our proposed method, the OMT-APC model, for predicting WHO grade, IDH mutation, and 1p/19q codeletion. The OMT-APC model process includes the following steps:
- 1.Image Preprocessing: Extracting brain images from raw data.
- 2.Tumor Segmentation: Using an OMT-based nnU-Net model to segment tumor regions in the OMT tensors of the extracting brain images.
- 3.Preclassification: Applying a pretrained MvOPLS model to generate algebraic preclassifications of the testing OMT tensors.
- 4.Model Training: Training the OMT-APC model using both the training OMT tensors and the algebraic preclassifications of the test data.
For the WHO grade classification, low-grade gliomas (WHO grade 2 and 3) are considered the positive class, while high-grade gliomas (WHO grade 4) are the negative class. For IDH mutation status classification, IDH-mutated samples are treated as the positive class. Similarly, for 1p/19q codeletion status classification, 1p/19q codeleted samples are regarded as the positive class. The TCGA-GBM and TCGA-LGG datasets serve as benchmarks for evaluating the classification model.
Model Evaluation and Human–AI competing Test.
For comparison, we chose the following three deep learning models widely utilized in medical imaging: MedTrans (46), Effnet (47), and PSPnet (48).
The performance of the deep learning model was evaluated by assessing the AUC of the receiver operative characteristic curve (ROC), Accuracy (ACC), Sensitivity (SENS), Specificity (SPEC), Positive Predictive Value (PPV), Negative Predictive Value (NPV), and F1-score.
TCGA-GBM and TCGA-LGG datasets were designated as test set for human–AI competing because these two datasets were most widely used in machine learning–based Glioma imaging genotyping research. Four board-certificated neuroradiologists, including two junior neuroradiologists with 3-y-experience in glioma imaging and two senior neuroradiologist with 8 y of experience, independently read T_1_CE and T_2_WI images and predicted the WHO grade, IDH mutation, and 1p/19q codeletion status of patients. The radiologists were blinded to clinical characteristics and pathological results during the evaluation.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1M. Price , CBTRUS statistical report: American Brain Tumor Association & NCI neuro-oncology branch adolescent and young adult primary brain and other central nervous system tumors diagnosed in the United States in 2016–2020. Neuro. Oncol. 26, iii 1–iii 53 (2024).38709657 10.1093/neuonc/noae 047PMC 11073545 · doi ↗ · pubmed ↗
- 2H. Pinson , Epidemiology and survival of adult-type diffuse glioma in Belgium during the molecular era. Neuro Oncol. 26, 191–202 (2024).37651614 10.1093/neuonc/noad 158PMC 10768998 · doi ↗ · pubmed ↗
- 3T. Nakase , Genome-wide polygenic risk scores predict risk of glioma and molecular subtypes. Neuro-oncology 26, 1933–1944 (2024).38916140 10.1093/neuonc/noae 112PMC 11448969 · doi ↗ · pubmed ↗
- 4D. N. Louis , The 2021 WHO classification of tumors of the central nervous system: A summary. Neuro-Oncol 23, 1231–1251 (2021).34185076 10.1093/neuonc/noab 106PMC 8328013 · doi ↗ · pubmed ↗
- 5T. Jiang , Clinical practice guidelines for the management of adult diffuse gliomas. Cancer Lett. 499, 60–72 (2021).33166616 10.1016/j.canlet.2020.10.050 · doi ↗ · pubmed ↗
- 6J. Zhou , Review of tracer kinetic models in evaluation of gliomas using dynamic contrast-enhanced imaging. Front. Oncol. 14, 1380793 (2024).38947892 10.3389/fonc.2024.1380793 PMC 11211364 · doi ↗ · pubmed ↗
- 7S. S. Ahn , Identification of magnetic resonance imaging features for the prediction of molecular profiles of newly diagnosed glioblastoma. J. Neurooncol. 154, 83–92 (2021).34191225 10.1007/s 11060-021-03801-y · doi ↗ · pubmed ↗
- 8K. M. Kang , MRI scoring systems for predicting isocitrate dehydrogenase mutation and chromosome 1p/19q codeletion in adult-type diffuse glioma lacking contrast enhancement. Radiology 311, e 233120 (2024).38713025 10.1148/radiol.233120 · doi ↗ · pubmed ↗
