# Detection of Acromegaly From Facial Images Using Machine Learning: A Comparison With Clinical Experts

**Authors:** Konstantina Vouzouneraki, Erik Ylipää, Tommy Olsson, Katarina Berinder, Charlotte Höybye, Maria Petersson, Sophie Bensing, Anna-Karin Åkerman, Henrik Borg, Bertil Ekman, Jonas Robért, Britt Edén Engström, Oskar Ragnarsson, Pia Burman, Per Dahlqvist

PMC · DOI: 10.1210/jendso/bvaf203 · Journal of the Endocrine Society · 2025-12-10

## TL;DR

A machine learning model using facial images can detect acromegaly with accuracy similar to expert doctors, suggesting it could be a useful screening tool.

## Contribution

A facial-image-based machine learning model (FaRL) achieves diagnostic accuracy comparable to human experts for acromegaly detection.

## Key findings

- The FaRL model matched human experts in diagnostic accuracy (AUC 0.89) with higher sensitivity.
- Machine learning and experts both showed greater sensitivity in detecting acromegaly in male patients.
- Classification agreement between the best model and experts was 86% for true negatives and 60% for true positives.

## Abstract

Substantial diagnostic delay in acromegaly contributes to increased morbidity and mortality. Screening attempts in high-risk groups have yielded few positive cases, underscoring the need for simple and precise prescreening methods.

Machine-learning analysis of facial images shows promise for acromegaly detection but requires validation in larger, well-characterized cohorts using robust machine-learning frameworks as performed in this study.

Facial images from different angles were collected via smartphone from 155 acromegaly patients (79% biochemically controlled) and 153 matched controls at all Swedish university hospitals. Six machine-learning models were trained to distinguish acromegaly from control images, including 3 deep neural networks pretrained on diverse image datasets (ImageNet models: ResNet50, InceptionV2, and DenseNet121) and 1 network pretrained specifically on facial images (FaRL). Model performance was compared to assessment by 12 experienced endocrinologists.

The diagnostic accuracy of the FaRL-based model was superior to all ImageNet models and matched the accuracy of human experts (area under the receiver operating characteristic curve 0.89 for both) with similar specificity (0.87 vs 0.93) but higher sensitivity (0.82 vs 0.66). Classification agreement between the best machine-learning model (FaRL) and human experts was 86% for true negatives and 60% for true positives. Machine-learning models and human experts both showed greater sensitivity in identifying acromegaly in male patients.

A deep learning model pretrained on facial features (FaRL) can detect acromegaly from standard photographs with accuracy comparable to that of expert endocrinologists. This supports the feasibility of face analysis as a screening tool for acromegaly.

## Linked entities

- **Diseases:** acromegaly (MONDO:0019933)

## Full-text entities

- **Diseases:** Acromegaly (MESH:D000172)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12838525/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12838525/full.md

## References

38 references — full list in the complete paper: https://tomesphere.com/paper/PMC12838525/full.md

---
Source: https://tomesphere.com/paper/PMC12838525