# Pixel-level annotations of context region for trustworthy diagnosis of Breast Lymph Node Metastasis from Histopathological WSI

**Authors:** Richa Malviya Dutta, Arif Ahmed Sekh, Prarthana Raghuram, Krishna Kiran, Shirin Dasgupta, Subhamoy Mandal, Debi Prosad Dogra, Pranab K. Dan

PMC · DOI: 10.1016/j.dib.2025.112408 · 2025-12-22

## TL;DR

This paper introduces a dataset of breast lymph node metastasis images with precise pixel-level annotations by experts to improve trustworthy automated cancer diagnosis.

## Contribution

The paper presents a rare, expert-verified dataset with pixel-level annotations for context regions in breast lymph node metastasis WSI.

## Key findings

- A two-stage annotation protocol was used to create high-quality pixel-level labels for 73 Camelyon16 training images.
- Annotations include diagnostically significant regions, background tissue, and artefacts, aiding in reliable patch preparation for metastasis detection.
- The dataset supports the development of trustworthy automated diagnosis systems by providing critical context regions for cancer detection.

## Abstract

There is a dearth of annotated images in digital pathology, and annotations are pivotal for supervised automated diagnosis. This work aims to create a set of data on breast lymph node metastatic whole slide images (WSI) that is truly valuable as it is annotated by the domain experts in a very precise and intensive manner. The annotations provided here are of rare kind that has pixel-level labels divided into three categories; first is based on the diagnostically significant tissue regions, second is background tissue and the third is artefacts present in breast lymph node metastatic WSI. High level annotations, as is provided in this work, for areas apart from the metastatic region is crucial for diagnosis as this is the context region which is critical for cancer diagnosis. Breast lymph node metastasis is a severe medical condition that requires significant efforts by pathologists to examine cancer under microscope using glass slides. Automated cancer diagnosis using WSI, the digitized versions of glass slides, needs to be trustworthy and responsible, which is ensured by involving domain experts in the process that begins with identifying diagnostically significant regions on a WSI. In this dataset, a two-stage high-value pixel-level annotation protocol is designed where selected 73 training images of Camelyon16 dataset are annotated by doctors and verified by pathologists for their accuracy. These annotations would help researchers to reliably prepare patches for further processing that can be used as training samples for extracting more such significant regions for data preparation from WSI for diagnosing metastasis. These reliable and trustworthy annotations would help to take it to the clinic from the research lab much quickly.

## Full-text entities

- **Diseases:** cancer (MESH:D009369), node (MESH:D012804), metastasis (MESH:D009362), Breast Lymph Node Metastasis (MESH:D061325)

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12813485/full.md

---
Source: https://tomesphere.com/paper/PMC12813485