# GastroGPT Pioneering Specialized AI in Gastroenterology: Strengths, Pitfalls, and the Road to Clinical Integration

**Authors:** Angad Tiwari, Hareesha Rishab Bharadwaj, Khabab Abbasher Hussien Mohamed Ahmed, Dushyant Singh Dahiya

PMC · DOI: 10.1002/jgh3.70306 · JGH Open: An Open Access Journal of Gastroenterology and Hepatology · 2025-11-16

## TL;DR

GastroGPT is a specialized AI for gastroenterology that outperforms general models in clinical tasks, showing promise for improving patient care.

## Contribution

GastroGPT introduces a domain-specific AI model for gastroenterology with improved performance on clinical tasks compared to general-purpose models.

## Key findings

- GastroGPT achieved a mean score of 8.1 ± 1.8 on a 10-point scale across simulated gastroenterology cases.
- It outperformed general models like GPT-4, Bard, and Claude in six out of seven clinical tasks.
- Its performance suggests potential for use in resource-limited settings due to reproducibility and consistency.

## Abstract

GastroGPT, a transformer‐based large language model, has been developed specifically for gastroenterology. It exhibited improved ability in clinical tasks compared to a general‐purpose model such as GPT‐4, Bard and Claude. GastroGPT was developed by Cem Simsek, MD, and was presented at UEG Week 2023. The GastroGPT dataset is adapted on 1.2 million tokens, including peer‐reviewed content from leading gastroenterology journals, clinical guidelines and 10 000 synthetic GI vignettes. In 10 simulated cases of inflammatory bowel disease, cases of endoscopy, and hepatology, GastroGPT achieved a mean score of 8.1 ± 1.8 on a 10‐point Likert scale. GastroGPT achieved higher mean scores compared to comparators (p < 0.001) on six out of seven tasks and included tasks such as patient history acquisition, recommendation for referral, and patient education. Its reproducibility and consistency across task complexities indicate its potential in situations of resource limitation. While it remains limited by its reliance on simulated cases, some participant selection and exposure bias attributed to training data, and lack of appropriate comparisons with medical‐specific models such as OpenEvidence, there remains the need for future real‐world trials and multimodal integrations within workflows to evaluate GastroGPT's transformation potential in improving gastroenterology workflows and patient care.

## Linked entities

- **Diseases:** inflammatory bowel disease (MONDO:0005265)

## Full-text entities

- **Diseases:** inflammatory bowel disease (MESH:D015212)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12620405/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12620405/full.md

## References

11 references — full list in the complete paper: https://tomesphere.com/paper/PMC12620405/full.md

---
Source: https://tomesphere.com/paper/PMC12620405