# Patch Clustering for Representation of Histopathology Images

**Authors:** Wafa Chenni, Habib Herbi, Morteza Babaie, H.R.Tizhoosh

arXiv: 1903.07013 · 2019-03-19

## TL;DR

This paper introduces a patch clustering method using SOM and GMM to efficiently represent large histopathology images, significantly reducing processing time and storage with minimal accuracy loss.

## Contribution

It proposes a novel approach combining SOM and GMM for patch selection in WSI, improving efficiency while maintaining high retrieval accuracy.

## Key findings

- Reduced search time and storage by 50-90%.
- LBP features outperform deep features in clustering.
- Achieved 65% retrieval accuracy with half the patches.

## Abstract

Whole Slide Imaging (WSI) has become an important topic during the last decade. Even though significant progress in both medical image processing and computational resources has been achieved, there are still problems in WSI that need to be solved. A major challenge is the scan size. The dimensions of digitized tissue samples may exceed 100,000 by 100,000 pixels causing memory and efficiency obstacles for real-time processing. The main contribution of this work is representing a WSI by selecting a small number of patches for algorithmic processing (e.g., indexing and search). As a result, we reduced the search time and storage by various factors between ($50\% - 90\%$), while losing only a few percentages in the patch retrieval accuracy. A self-organizing map (SOM) has been applied on local binary patterns (LBP) and deep features of the KimiaPath24 dataset in order to cluster patches that share the same characteristics. We used a Gaussian mixture model (GMM) to represent each class with a rather small ($10\%-50\%$) portion of patches. The results showed that LBP features can outperform deep features. By selecting only $50\%$ of all patches after SOM clustering and GMM patch selection, we received $65\%$ accuracy for retrieval of the best match, while the maximum accuracy (using all patches) was $69\%$.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.07013/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/1903.07013/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/1903.07013/full.md

---
Source: https://tomesphere.com/paper/1903.07013