PanNuke Dataset Extension, Insights and Baselines
Jevgenij Gamper, Navid Alemi Koohbanani, Ksenija Benes, Simon Graham,, Mostafa Jahanifar, Syed Ali Khurram, Ayesha Azam, Katherine Hewitt, Nasir, Rajpoot

TL;DR
This paper extends the PanNuke dataset for nuclei segmentation and classification in pathology, providing insights into model performance and application to real-world clinical data.
Contribution
It introduces an expanded, high-quality dataset with diverse tissue types and offers analysis of model performance and practical application in computational pathology.
Findings
Models trained on PanNuke perform well on diverse tissue types.
The dataset enables better understanding of nuclei segmentation challenges.
Recommendations for improving deep learning tools in clinical pathology.
Abstract
The emerging area of computational pathology (CPath) is ripe ground for the application of deep learning (DL) methods to healthcare due to the sheer volume of raw pixel data in whole-slide images (WSIs) of cancerous tissue slides. However, it is imperative for the DL algorithms relying on nuclei-level details to be able to cope with data from `the clinical wild', which tends to be quite challenging. We study, and extend recently released PanNuke dataset consisting of ~200,000 nuclei categorized into 5 clinically important classes for the challenging tasks of segmenting and classifying nuclei in WSIs. Previous pan-cancer datasets consisted of only up to 9 different tissues and up to 21,000 unlabeled nuclei and just over 24,000 labeled nuclei with segmentation masks. PanNuke consists of 19 different tissue types that have been semi-automatically annotated and quality controlled by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Radiomics and Machine Learning in Medical Imaging · COVID-19 diagnosis using AI
