# Classification of pediatric dental diseases from panoramic radiographs using natural language transformer and deep learning models

**Authors:** Tuan D. Pham, Seba Al-Hebshi

PMC · DOI: 10.3389/frai.2026.1754498 · Frontiers in Artificial Intelligence · 2026-03-03

## TL;DR

This paper proposes using text-based deep learning models to classify pediatric dental diseases from X-rays, showing that text-driven approaches can outperform traditional image-based models.

## Contribution

The novel contribution is a text-driven framework using natural language transformers to generate descriptions of dental radiographs for disease classification.

## Key findings

- A 1D-CNN achieved 84% accuracy in classifying pediatric dental diseases from text descriptions.
- BERT showed strong performance in detecting periapical infections but struggled with caries identification.
- Text-based models outperformed image-based CNNs, suggesting text-driven approaches are viable for dental disease classification.

## Abstract

Accurate classification of pediatric dental diseases from panoramic radiographs is essential for early diagnosis and effective treatment planning. While deep learning models traditionally operate directly on image data, text-based representations generated from radiographs may provide an alternative strategy for disease classification.

This study proposed a text-driven framework in which a natural language transformer was used to generate structured textual descriptions from panoramic radiographs. These descriptions were subsequently classified for binary disease detection using three deep learning architectures: a one-dimensional convolutional neural network (1D-CNN), a long short-term memory (LSTM) network, and a pretrained Bidirectional Encoder Representations from Transformer (BERT) model. Model performance was evaluated and compared against three pretrained convolutional neural networks trained directly on radiographic images.

The 1D-CNN achieved the highest performance with 84% accuracy, demonstrating balanced classification across disease categories. The BERT model reached 77% accuracy, showing strong performance in detecting periapical infections but comparatively lower sensitivity for caries identification. The LSTM model performed substantially worse, achieving 57% accuracy. Both the 1D-CNN and BERT text-based approaches outperformed the three image-based pretrained CNN models.

These findings suggest that text-based classification of panoramic radiographs is a potential alternative to conventional image-based deep learning methods. Language-driven models show promise for radiographic interpretation; however, challenges remain in achieving consistent generalizability across disease types. Future research should focus on improving radiograph-to-text generation quality, developing hybrid architectures that integrate textual and visual features, and validating performance on larger and more diverse datasets to strengthen clinical applicability.

## Full-text entities

- **Diseases:** dental diseases (MESH:D009057), periapical infections (MESH:D010483), caries (MESH:D003731)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13040369/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13040369/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/PMC13040369/full.md

---
Source: https://tomesphere.com/paper/PMC13040369