# Development and evaluation of an AI model for dental implant type detection: A comparison of diagnostic accuracy between a deep learning model and dental professionals

**Authors:** Walaa Magdy Ahmed, Amr Ahmed Azhari, Abdulrahman Almufti, Zainab Majed Alsadah, Amr Fawzy Abdelhamid Ahmed, Anas Lahiq, Khaled Ahmed Fawaz

PMC · DOI: 10.1111/jopr.70064 · Journal of Prosthodontics · 2025-11-26

## TL;DR

A deep learning model was developed to identify dental implant brands from X-rays and outperformed dental professionals in accuracy.

## Contribution

The study introduces a high-accuracy AI model for dental implant detection and compares it with clinicians and different YOLO versions.

## Key findings

- The YOLOv12x model achieved 98.9% mAP@50 and 90.0% mAP@50–95 in implant brand detection.
- The model outperformed all clinician groups across experience levels (p < 0.001).
- YOLOv12x showed better accuracy and speed than earlier YOLO versions and a transformer-based model.

## Abstract

To develop a deep‐learning system for identifying five dental implant brands from periapical radiographs and compare its diagnostic accuracy with dental professionals and evaluate successive You Only Look Once (YOLO) architectures (v7–v12) to justify model selection.

A dataset of 5851 periapical radiographs was compiled and divided into training, validation, and test partitions (80/10/10). After filtering to five brands (Adin, Dentium, Noris, OSSTEM, and Straumann), YOLO‐based object detection models (versions 7–12) were trained and tested under identical conditions. The YOLOv12x model was adopted for final evaluation based on its optimal balance of accuracy and inference speed. Human performance was assessed using 100 held‐out test images (20 per brand) via a multiple‐choice web survey distributed to six clinician groups. Diagnostic metrics included mean average precision at IoU 0.50 (mAP@50), mAP@50–95, precision, recall, and accuracy.

The model achieved an mAP@50 of 0.989 (98.9%), an mAP@50–95 of 0.900 (90.0%), a precision of 0.969 (96.9%), and a recall of 0.977 (97.7%) across brands. Across YOLO generations, performance improved from mAP@50–95 = 0.817 (YOLOv7) to 0.905 (YOLOv12x). Fifty‐two clinicians completed 5,200 image evaluations; the model significantly outperformed all clinician subgroups (one‐way ANOVA with Tukey HSD, p < 0.001). Transformer‐based DF‐DETR achieved mAP@50–95 = 0.878, confirming YOLOv12x's superior efficiency–accuracy trade‐off.

A high‐performing model identified implant brands on periapical radiographs and outperformed clinicians across experience levels. Comparative analysis across YOLO architectures validated its measurable advantage in accuracy and speed. Lack of external validation and dataset imbalance are important limitations; future work will include external, multisite data and human–AI workflow evaluation.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12791189/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12791189/full.md

## References

21 references — full list in the complete paper: https://tomesphere.com/paper/PMC12791189/full.md

---
Source: https://tomesphere.com/paper/PMC12791189