A comprehensive multimodal dataset and benchmark for ulcerative colitis scoring in endoscopy
Noha Ghatwary, Jiangbei Yue, Ahmed Elgendy, Hanna Nagdy, Ahmed Galal, Hayam Fathy, Hussein El-Amin, Venkataraman Subramanian, Noor Mohammed, Gilberto Ochoa-Ruiz, and Sharib Ali

TL;DR
This paper introduces a large, expert-annotated multimodal dataset for ulcerative colitis endoscopy scoring, enabling improved automated assessment and captioning of mucosal images, and provides benchmark results for various models.
Contribution
It presents the first comprehensive multimodal dataset with dual scoring and expert captions for UC, facilitating research in automated scoring and image captioning.
Findings
Benchmarking with CNNs, transformers, and multimodal models achieved promising results.
The dataset improves robustness and generalisability of UC assessment algorithms.
Expert captions provide clinically meaningful descriptions of mucosal appearance.
Abstract
Ulcerative colitis (UC) is a chronic mucosal inflammatory condition that places patients at increased risk of colorectal cancer. Colonoscopic surveillance remains the gold standard for assessing disease activity, and reporting typically relies on standardised endoscopic scoring metrics. The most widely used is the Mayo Endoscopic Score (MES), with some centres also adopting the Ulcerative Colitis Endoscopic Index of Severity (UCEIS). Both are descriptive assessments of mucosal inflammation (MES: 0 to 3; UCEIS: 0 to 8), where higher values indicate more severe disease. However, computational methods for automatically predicting these scores remain limited, largely due to the lack of publicly available expert-annotated datasets and the absence of robust benchmarking. There is also a significant research gap in generating clinically meaningful descriptions of UC images, despite image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInflammatory Bowel Disease · Colorectal Cancer Screening and Detection · Multimodal Machine Learning Applications
