# An ultrasound-based artificial intelligence framework for difficult airway prediction: A two-model, three-step decision framework

**Authors:** Chunmeng Fu, Cunyuan Luan, Huabo Liu, Wenfei Wang, Xia Zhou, Yuanfang Jia, Bing Ding, Lei Zhang, Li Yuan, Zejun Niu

PMC · DOI: 10.1371/journal.pone.0342339 · PLOS One · 2026-02-18

## TL;DR

This study introduces an AI framework using ultrasound images to predict difficult airways during anesthesia, aiming to assist clinicians in risk assessment.

## Contribution

A novel two-model, three-step AI framework for difficult airway prediction using ultrasound images is proposed.

## Key findings

- The CL-AI model achieved an AUC of 0.86 in the independent test set for difficult airway prediction.
- The VIDIAC-AI model showed an AUC of 0.82 for secondary risk stratification of difficult laryngoscopic exposure.
- The framework is intended as a decision-support tool and requires further validation in larger studies.

## Abstract

At present, the early warning of difficult airway remains fraught with challenges. Previous ultrasonic quantitative parameters have demonstrated favorable application potential in difficult airway assessment, and deep learning techniques have also exhibited satisfactory performance in the interpretation of this condition. Based on this, we aim to construct a “two-model, three-step” hierarchical strategy, develop an ultrasound image-based artificial intelligence (AI) framework for difficult airway prediction, and conduct its internal validation.

In this study, we included 903 patients who underwent elective general anesthesia surgery at the Affiliated Hospital of Qingdao University between May 2024 and April 2025. 752 cases were used for model training and validation, and 151 cases served as an internal test set. Four planes of neck ultrasound images were scanned for each patient and used to develop two artificial intelligence models (based on convolutional neural networks): CL-AI for initial screening and VIDIAC-AI for secondary risk stratification. Model performance was evaluated using five-fold cross-validation and internal testing. External validation was not performed.

Among 903 patients, difficult laryngoscopy occurred in 189 cases (20.9%) under direct laryngoscopy and in 50 cases (5.5%) under video laryngoscopy. In the independent test set, the CL-AI model achieved an AUC of 0.86 (95% CI: 0.79–0.91), with an accuracy of 0.84, sensitivity of 0.84, specificity of 0.84, precision of 0.59, and an F1 score of 0.69. The VIDIAC-AI model achieved an AUC of 0.82 (95% CI: 0.75–0.88), with an accuracy of 0.81, sensitivity of 0.75, specificity of 0.81, precision of 0.18, and an F1 score of 0.29.

This study proposes an ultrasound-based AI framework for risk stratification of difficult laryngoscopic exposure. The Two-Model, Three-Step decision framework is intended as a clinician decision-support tool, not an independent diagnostic method, and requires further validation in large multicenter cohorts.

## Full-text entities

- **Genes:** MLC1 (modulator of VRAC current 1) [NCBI Gene 23209] {aka LVM, MLC, VL}
- **Diseases:** critically ill (MESH:D016638), head and neck fractures (MESH:D006258), DL (MESH:D051556), AI (MESH:C538142), neurological disorders (MESH:D009461), VIDIAC (MESH:D008310), TMD (MESH:D049310), language communication disorders (MESH:D003147)
- **Chemicals:** midazolam (MESH:D008874), propofol (MESH:D015742), rocuronium (MESH:D000077123), DL (-), sufentanil (MESH:D017409)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12915933/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12915933/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/PMC12915933/full.md

---
Source: https://tomesphere.com/paper/PMC12915933