# Transformer-Based Deep Learning for Preoperative Prediction of Microvascular Invasion in Hepatocellular Carcinoma

**Authors:** Ruilin He, Huilin Chen, Wenjie Zou, Mengting Gu, Xingyu Zhao, Ningyang Jia, Wanmin Liu

PMC · DOI: 10.3390/cancers17203314 · Cancers · 2025-10-14

## TL;DR

A deep learning model using MRI and clinical data accurately predicts microvascular invasion in liver cancer, helping guide personalized treatment.

## Contribution

A Transformer-based model for three-class MVI prediction in HCC using radiomics and clinical features.

## Key findings

- The model achieved 0.733 accuracy and 0.880 AUC on the internal test set.
- It maintained 0.758 accuracy and 0.886 AUC on an external validation set.
- High sensitivity (0.73) and specificity (0.91) for high-risk M2 cases were observed.

## Abstract

Microvascular invasion (MVI) is a key predictor of recurrence in hepatocellular carcinoma (HCC). We developed a Transformer-based deep learning model to noninvasively predict three MVI grades (M0, M1, and M2) using preoperative MRI radiomics and clinical biomarkers. The model achieved high accuracy across independent datasets. Notably, reliable identification of high-risk M2 patients before surgery could help optimize resection margins, guide adjuvant therapy, and support individualized treatment planning.

Background: Microvascular invasion (MVI) is a critical prognostic factor in hepatocellular carcinoma (HCC), but preoperative three-class prediction remains challenging. Radiomics and clinical biomarkers may enable more accurate and individualized assessment. Aim: The aim of this study was to develop and validate a Transformer-based deep learning framework that integrates radiomic and clinical features for direct three-class MVI classification in HCC patients. Methods: This retrospective study included 437 patients with pathologically confirmed hepatocellular carcinoma (HCC) and microvascular invasion (MVI) status from two campuses of a single institution. Patients from Hospital A (n = 305) were randomly divided into training and internal test cohorts, while patients from Hospital B (n = 132) were used as an independent external validation cohort. Radiomic features were extracted from preoperative Gd-BOPTA-enhanced MRI, and clinical laboratory data were collected. A two-stage feature selection strategy, combining univariate statistical testing and recursive feature elimination, was applied. A Transformer-based model was built to classify three MVI categories (M0, M1, M2), and its performance was evaluated in both the internal test cohort and the external validation cohort. Results were compared with those from traditional machine learning models, including Random Forest, Logistic Regression, XGBoost, and LightGBM. Results: On the internal test set (n = 76, Hospital A), the model achieved an accuracy of 0.733 (95% CI: 0.64–0.83), a weighted F1-score of 0.733, and a macro-average AUC of 0.880 (95% CI: 0.807–0.953). The sensitivity and specificity for M1 were 0.56 (95% CI: 0.31–0.78) and 0.86 (95% CI: 0.74–0.94), respectively; for high-risk M2 cases, the sensitivity was 0.73 (95% CI: 0.64–0.81) and the specificity was 0.91 (95% CI: 0.85–0.96). On the external validation set (n = 132, Hospital B), performance remained stable with an accuracy of 0.758, a weighted F1-score of 0.768, and a macro-average AUC of 0.886 (95% CI: 0.833–0.940). Conclusions: This Transformer-based model enables accurate and objective three-class MVI prediction using multi-modal features, supporting individualized surgical planning and improved clinical outcomes. In particular, the ability to preoperatively identify high-risk M2 patients may inform surgical margin design, guide adjuvant therapy strategies, and influence liver transplantation eligibility.

## Linked entities

- **Chemicals:** Gd-BOPTA (PubChem CID 9874536)
- **Diseases:** hepatocellular carcinoma (MONDO:0007256)

## Full-text entities

- **Diseases:** HCC (MESH:D006528)
- **Chemicals:** Gd-BOPTA (MESH:C064572)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12564389/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12564389/full.md

## References

32 references — full list in the complete paper: https://tomesphere.com/paper/PMC12564389/full.md

---
Source: https://tomesphere.com/paper/PMC12564389