# Oral squamous cell carcinoma grading classification using deep transformer encoder assisted dilated convolution with global attention

**Authors:** Singaraju Ramya, R. I. Minu

PMC · DOI: 10.3389/frai.2025.1575427 · 2025-10-17

## TL;DR

This paper introduces a new deep learning model for classifying oral cancer grades with high accuracy using advanced techniques like transformers and data augmentation.

## Contribution

The novel DeTr-DiGAtt model improves OSCC classification accuracy and efficiency using a transformer encoder and global attention mechanisms.

## Key findings

- The proposed model achieves 98.59% accuracy in OSCC grading classification.
- Segmentation results show a Dice score of 97.97% and IoU of 98.08%.
- Ad-GreLop hyperparameter tuning enhances model performance.

## Abstract

In recent years, Oral Squamous Cell Carcinoma (OSCC) has been a common tumor in the orofacial region, affecting areas such as the teeth, jaw, and temporomandibular joint. OSCC is classified into three grades: “well-differentiated, moderately differentiated, and poorly differentiated,” with a high morbidity and mortality rate among patients. Several existing methods, such as AlexNet, CNN, U-Net, and V-Net, have been used for OSCC classification. However, these methods face limitations, including low ACC, poor comparability, insufficient data collection, and prolonged training times. To address these limitations, we introduce a novel Deep Transformer Encoder-Assisted Dilated Convolution with Global Attention (DeTr-DiGAtt) model for OSCC classification. To enhance the dataset and mitigate over-fitting, a GAN model is employed for data augmentation. Additionally, an Adaptive Bilateral Filter (Ad-BF) is used to improve image quality and remove undesirable noise. For accurate identification of the affected region, an Improved Multi-Encoder Residual Squeeze U-Net (Imp-MuRs-Unet) model is utilized for segmentation. The DeTr-DiGAtt model is then applied to classify different OSCC grading levels. Furthermore, an Adaptive Grey Lag Goose Optimization Algorithm (Ad-GreLop) is used for hyperparameter tuning. The proposed method achieves an accuracy (ACC) of 98.59%, a Dice score of 97.97%, and an Intersection over Union (IoU) of 98.08%.

## Linked entities

- **Diseases:** Oral Squamous Cell Carcinoma (MONDO:0004958)

## Full-text entities

- **Diseases:** tumor (MESH:D009369), OSCC (MESH:D000077195)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

22 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12575215/full.md

---
Source: https://tomesphere.com/paper/PMC12575215