# ChatGPT-4o with faculty guidance outperforms AI-only and traditional learning in ultrasonography training: a randomized trial

**Authors:** Dao-Rong Hong, Chun-Yan Huang, Jiu Gao

PMC · DOI: 10.3389/fdgth.2026.1772965 · Frontiers in Digital Health · 2026-03-06

## TL;DR

Blending ChatGPT-4o with faculty tutorials improves ultrasonography training for residents more than AI-only or traditional methods.

## Contribution

A blended AI-human approach outperforms AI-only and traditional learning in ultrasonography training.

## Key findings

- Blended group scored highest (128.40) compared to AI-only (119.87) and control (110.60).
- Blended group performed better in obstetrics/gynaecology and superficial organ ultrasonography.
- ChatGPT-4o alone had 47% accuracy on image-based questions, showing limitations in image analysis.

## Abstract

Ultrasonography training for residents is challenging owing to its operator-dependent nature and difficulties in mastering subtle image interpretation. Multimodal large language models like ChatGPT-4o enable efficient knowledge retrieval but show marked limitations in static ultrasonography image analysis.

In this prospective, single-centre randomized controlled trial, 45 first-year ultrasonography residents were randomly allocated to control (traditional resources), AI-only (ChatGPT-4o exclusively), or blended (ChatGPT-4o plus weekly faculty tutorials) groups. After a 3-week intervention, performance was assessed using a 150-item examination (pure-text and image-based multiple-choice questions). The study was approved by the institutional ethics committee, and written informed consent was obtained.

The blended group achieved the highest scores (mean 128.40 ± 18.25) vs. AI-only (119.87 ± 19.11) and control (110.60 ± 20.45; P = 0.02), with superior pure-text performance (P = 0.03) and significant advantages in obstetrics/gynaecology (P = 0.04) and superficial organ ultrasonography (P = 0.047). Examination time was shortest in the blended group (P = 0.03). ChatGPT-4o alone was 85% accurate on text but only 47% on image-based questions.

A faculty-guided AI-integrated strategy was associated with improved short-term post-intervention performance compared with AI-only or traditional learning; however, effects reflect the combined intervention and AI support for static ultrasound image interpretation remains limited.

## Full-text entities

- **Diseases:** breast lesions (MESH:D061325), D-RH (MESH:C564833), MCQs (MESH:C538270), thyroid nodules (MESH:D016606), LLMs (MESH:D007806)
- **Chemicals:** LLM (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12969794/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12969794/full.md

## References

22 references — full list in the complete paper: https://tomesphere.com/paper/PMC12969794/full.md

---
Source: https://tomesphere.com/paper/PMC12969794