REAL-Colon: A dataset for developing real-world AI applications in colonoscopy
Carlo Biffi, Giulio Antonelli, Sebastian Bernhofer, Cesare Hassan,, Daizen Hirata, Mineo Iwatate, Andreas Maieron, Pietro Salvagnini, Andrea, Cherubini

TL;DR
The paper introduces REAL-Colon, a large, high-quality dataset of colonoscopy videos with annotations and clinical data, aimed at advancing AI applications in real-world colonoscopy procedures.
Contribution
It provides the first extensive, multi-center, real-world colonoscopy video dataset with detailed annotations and clinical information for AI research.
Findings
Dataset contains 2.7 million frames from 60 colonoscopies.
Includes 350,000 expert-annotated bounding boxes.
Enables benchmarking and development of AI models for colonoscopy.
Abstract
Detection and diagnosis of colon polyps are key to preventing colorectal cancer. Recent evidence suggests that AI-based computer-aided detection (CADe) and computer-aided diagnosis (CADx) systems can enhance endoscopists' performance and boost colonoscopy effectiveness. However, most available public datasets primarily consist of still images or video clips, often at a down-sampled resolution, and do not accurately represent real-world colonoscopy procedures. We introduce the REAL-Colon (Real-world multi-center Endoscopy Annotated video Library) dataset: a compilation of 2.7M native video frames from sixty full-resolution, real-world colonoscopy recordings across multiple centers. The dataset contains 350k bounding-box annotations, each created under the supervision of expert gastroenterologists. Comprehensive patient clinical data, colonoscopy acquisition information, and polyp…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsColorectal Cancer Screening and Detection
