# Multi-Class Malocclusion Detection on Standardized Intraoral Photographs Using YOLOv11

**Authors:** Ani Nebiaj, Markus Mühling, Bernd Freisleben, Babak Sayahpour

PMC · DOI: 10.3390/dj14010060 · Dentistry Journal · 2026-01-16

## TL;DR

This paper introduces a YOLOv11-based system for automatically detecting multiple types of dental malocclusions in intraoral photos, aiming to improve efficiency and consistency in orthodontic screening.

## Contribution

The novel contribution is a deep learning model using YOLOv11 trained on standardized annotations to detect 15 malocclusion classes in routine intraoral photographs.

## Key findings

- The model achieved 87.8% mAP50, 76.9% macro-P, and 86.1% macro-R across 15 malocclusion classes.
- High performance was observed for Deep bite (98.8%), Diastema (97.9%), and Angle Class II canine (97.5%).
- Lower performance was noted for Angle Class III canine (76%) and Posterior crossbite (75.6%), attributed to limited examples and visualization constraints.

## Abstract

Background/Objectives: Accurate identification of dental malocclusions from routine clinical photographs can be time-consuming and subject to interobserver variability. A YOLOv11-based deep learning approach is presented and evaluated for automatic malocclusion detection on routine intraoral photographs, testing the hypothesis that training on a structured annotation protocol enables reliable detection of multiple clinically relevant malocclusions. Methods: An anonymized dataset of 5854 intraoral photographs (frontal occlusion; right/left buccal; maxillary/mandibular occlusal) was labeled according to standardized instructions derived from the Index of Orthodontic Treatment Need (IOTN) A total of 17 clinically relevant classes were annotated with bounding boxes. Due to an insufficient number of examples, two malocclusions (transposition and non-occlusion) were excluded from our quantitative analysis. A YOLOv11 model was trained with augmented data and evaluated on a held-out test set using mean average precision at IoU 0.5 (mAP50), macro precision (macro-P), and macro recall (macro-R). Results: Across 15 analyzed classes, the model achieved 87.8% mAP50, 76.9% macro-P, and 86.1% macro-R. The highest per-class AP50 was observed for Deep bite (98.8%), Diastema (97.9%), Angle Class II canine (97.5%), Anterior open bite (92.8%), Midline shift (91.8%), Angle Class II molar (91.1%), Spacing (91%), and Crowding (90.1%). Moderate performance included Anterior crossbite (88.3%), Angle Class III molar (87.4%), Head bite (82.7%), and Posterior open bite (80.2%). Lower values were seen for Angle Class III canine (76%), Posterior crossbite (75.6%), and Big overjet (75.3%). Precision–recall trends indicate earlier precision drop-off for posterior/transverse classes and comparatively more missed detections in Posterior crossbite, whereas Big overjet exhibited more false positives at the chosen threshold. Conclusion: A YOLOv11-based deep learning system can accurately detect several clinically salient malocclusions on routine intraoral photographs, supporting efficient screening and standardized documentation. Performance gaps align with limited examples and visualization constraints in posterior regions. Larger, multi-center datasets, protocol standardization, quantitative metrics, and multimodal inputs may further improve robustness.

## Full-text entities

- **Diseases:** transposition (MESH:C536650), Head bite (MESH:D006258), III molar (MESH:D006828), Malocclusion (MESH:D008310), Anterior open bite (MESH:D024343)
- **Species:** Canis lupus familiaris (dog, subspecies) [taxon 9615]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12839579/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12839579/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/PMC12839579/full.md

---
Source: https://tomesphere.com/paper/PMC12839579