# LarynxFormer: a transformer-based framework for processing and segmenting laryngeal images

**Authors:** Rune Mæstad, Abdul Hanan, Haakon Kristian Kvidaland, Hege Clemm, Reza Arghandeh

PMC · DOI: 10.3389/fdgth.2025.1459136 · Frontiers in Digital Health · 2025-07-11

## TL;DR

This paper introduces LarynxFormer, a new AI framework that improves the speed and accuracy of diagnosing laryngeal issues during exercise.

## Contribution

LarynxFormer is a novel transformer-based framework that outperforms existing methods in laryngeal image segmentation.

## Key findings

- LarynxFormer achieves better performance metrics than existing laryngeal segmentation models.
- The framework provides up to 2x faster inference time compared to other methods.
- Transformer-based models outperform convolutional-based models in this task.

## Abstract

Manual diagnostic methods for assessing exercise-induced laryngeal obstruction (EILO) contain human bias and can lead to subjective decisions. Several studies have proposed machine learning methods for segmenting laryngeal structures to automate and make diagnostic outcomes more objective. Four state-of-the-art models for laryngeal image segmentation are implemented, trained, and compared using our pre-processed dataset containing laryngeal images derived from continuous laryngoscopy exercise-test (CLE-test) data. These models include both convolutional-based and transformer-based methods. We propose a new framework called LarynxFormer, consisting of a pre-processing pipeline, transformer-based segmentation, and post-processing of laryngeal images. This study contributes to the investigation of using machine learning as a diagnostic tool for EILO. Furthermore, we show that a transformer-based approach for larynx segmentation outperforms conventional state-of-the-art image segmentation methods in terms of performance metrics and computational speed, demonstrating up to 2x faster inference time compared to the other methods.

## Full-text entities

- **Diseases:** induced laryngeal obstruction (MESH:D007827), EILO (MESH:D000092202)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12289634/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12289634/full.md

## References

27 references — full list in the complete paper: https://tomesphere.com/paper/PMC12289634/full.md

---
Source: https://tomesphere.com/paper/PMC12289634