# Evaluating masked self-supervised learning frameworks for 3D dental model segmentation tasks

**Authors:** Lucas Krenmayr, Reinhold von Schwerin, Daniel Schaudt, Pascal Riedel, Alexander Hafner, Marc Geserick

PMC · DOI: 10.1038/s41598-025-01014-1 · Scientific Reports · 2025-05-14

## TL;DR

This paper explores how masked self-supervised learning can improve 3D dental model segmentation when labeled data is limited.

## Contribution

The study evaluates four masked self-supervised learning frameworks for 3D dental segmentation tasks using pre-training on unlabeled dental models.

## Key findings

- Pre-training improves segmentation performance when labeled data is scarce or imbalanced.
- The benefits of pre-training diminish as more labeled data becomes available.
- Point-based frameworks like Point-BERT and Point-MAE show promise for dental applications.

## Abstract

The application of deep learning using dental models is crucial for automated computer-aided treatment planning. However, developing highly accurate models requires a substantial amount of accurately labeled data. Obtaining this data is challenging, especially in the medical domain. Masked self-supervised learning has shown great promise in overcoming the challenge of data scarcity. However, its effectiveness has not been well explored in the 3D domain, particularly on dental models. In this work, we investigate the applicability of the four recently published masked self-supervised learning frameworks-Point-BERT, Point-MAE, Point-GPT, and Point-M2AE-for improving downstream tasks such as tooth and brace segmentation. These frameworks were pre-trained on a proprietary dataset of over 4000 unlabeled 3D dental models and fine-tuned using the publicly available Teeth3DS dataset for tooth segmentation and a self-constructed braces segmentation dataset. Through a set of experiments we demonstrate that pre-training can enhance the performance of downstream tasks, especially when training data is scarce or imbalanced—a critical factor for clinical usability. Our results show that the benefits are most noticeable when training data is limited but diminish as more labeled data becomes available, providing insights into when and how this technique should be applied to maximize its effectiveness.

## Full-text entities

- **Diseases:** tooth misalignments (MESH:D017760)
- **Chemicals:** dVAE (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12078790/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12078790/full.md

## References

31 references — full list in the complete paper: https://tomesphere.com/paper/PMC12078790/full.md

---
Source: https://tomesphere.com/paper/PMC12078790