A Cytology Dataset for Early Detection of Oral Squamous Cell Carcinoma
Garima Jain, Sanghamitra Pati, Mona Duggal, Amit Sethi, Abhijeet Patil, Gururaj Malekar, Nilesh Kowe, Jitender Kumar, Jatin Kashyap, Divyajeet Rout, Deepali, Hitesh, Nishi Halduniya, Sharat Kumar, Heena Tabassum, Rupinder Singh Dhaliwal, Sucheta Devi Khuraijam, Sushma Khuraijam

TL;DR
This paper introduces the first large, multicenter oral cytology dataset with expert annotations, aiming to facilitate AI-driven early detection of oral squamous cell carcinoma, especially in resource-limited settings.
Contribution
It provides a comprehensive, labeled dataset of oral cytology slides from multiple centers, addressing the lack of publicly available data for AI model development in OSCC detection.
Findings
Dataset includes annotated slides stained with PAP and MGG protocols.
Collected from ten medical centers across India.
Designed to improve AI-based early diagnosis of OSCC.
Abstract
Oral squamous cell carcinoma OSCC is a major global health burden, particularly in several regions across Asia, Africa, and South America, where it accounts for a significant proportion of cancer cases. Early detection dramatically improves outcomes, with stage I cancers achieving up to 90 percent survival. However, traditional diagnosis based on histopathology has limited accessibility in low-resource settings because it is invasive, resource-intensive, and reliant on expert pathologists. On the other hand, oral cytology of brush biopsy offers a minimally invasive and lower cost alternative, provided that the remaining challenges, inter observer variability and unavailability of expert pathologists can be addressed using artificial intelligence. Development and validation of robust AI solutions requires access to large, labeled, and multi-source datasets to train high capacity models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Head and Neck Cancer Studies · Oral Health Pathology and Treatment
