# Quantifying the effects of data augmentation and stain color   normalization in convolutional neural networks for computational pathology

**Authors:** David Tellez, Geert Litjens, Peter Bandi, Wouter Bulten, John-Melle, Bokhorst, Francesco Ciompi, Jeroen van der Laak

arXiv: 1902.06543 · 2020-04-16

## TL;DR

This study systematically compares stain color augmentation and normalization techniques in CNNs for pathology, demonstrating their impact on classification across diverse labs and proposing a new normalization method.

## Contribution

It provides the first comprehensive comparison of stain normalization and augmentation techniques in computational pathology and introduces a novel unsupervised stain normalization method.

## Key findings

- Stain augmentation improves CNN robustness across labs.
- Normalization reduces color distribution discrepancies.
- Proposed method outperforms existing normalization techniques.

## Abstract

Stain variation is a phenomenon observed when distinct pathology laboratories stain tissue slides that exhibit similar but not identical color appearance. Due to this color shift between laboratories, convolutional neural networks (CNNs) trained with images from one lab often underperform on unseen images from the other lab. Several techniques have been proposed to reduce the generalization error, mainly grouped into two categories: stain color augmentation and stain color normalization. The former simulates a wide variety of realistic stain variations during training, producing stain-invariant CNNs. The latter aims to match training and test color distributions in order to reduce stain variation. For the first time, we compared some of these techniques and quantified their effect on CNN classification performance using a heterogeneous dataset of hematoxylin and eosin histopathology images from 4 organs and 9 pathology laboratories. Additionally, we propose a novel unsupervised method to perform stain color normalization using a neural network. Based on our experimental results, we provide practical guidelines on how to use stain color augmentation and stain color normalization in future computational pathology applications.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.06543/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1902.06543/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/1902.06543/full.md

---
Source: https://tomesphere.com/paper/1902.06543