StainStyleSampler: Clustering-based sampling of whole slide image appearances

Avatar
Poster
Voice is AI-generated
Connected to paperThis paper is a preprint and has not been certified by peer review

StainStyleSampler: Clustering-based sampling of whole slide image appearances

Authors

Silva, M. M. B.; Leh, S.; Weishaupt, H.

Abstract

The appearance of whole slide biopsy images is greatly affected by various factors such as laboratory procedures or the choice of digital slide scanners. The resulting variations in image styles within and across batches of histological images represent one of the major obstacles to the development of generalizable machine learning algorithms. To overcome this challenge, a lot of research has focused on stain normalization and stain augmentation techniques. While such approaches provide effective strategies to reduce stain variation or increase stain invariance, respectively, they typically involve only limited modeling or sampling of the underlying stain style distribution. Tools for a streamlined sampling of different aspects of such a distribution, which would be crucial e.g. for explicitly evaluating machine learning robustness across or with respect to major stain styles, remain largely missing. Here, we present the StainStyleSampler, a toolkit for (i) the exploration and modeling of stain style variations, and (ii) the automated sampling of images or styles capturing the core components of this variation. The tool enables the extraction of various color features and deconvolved stain components, visualization of such features directly or after dimensionality reduction, modeling of style distributions using binning, clustering, and density mapping, and automated sampling of the most representative reference images. We believe that this software will equip pathologists and computer-scientists with a more versatile set of tools that can aid substantially in both the exploration and sampling of stain variation across whole slide images.

Follow Us on

0 comments

Add comment