Long Oral
in
Workshop: Trustworthy Machine Learning for Healthcare
Do Tissue Source Sites leave identifiable Signatures in Whole Slide Images beyond staining?
Muhammad Dawood · Piotr Keller · Fayyaz ul Amir Minhas
Why can deep learning predictors trained on Whole Slide Images fail to generalize? It is a common theme in Computational Pathology to see a high performing model developed in a research setting experience a large drop in performance when it is deployed to a new clinical environment. One of the major reasons for this is the batch effect that is introduced during the creation of Whole Slide Images resulting in a domain shift. Computational Pathology pipelines try to reduce this effect via stain normalization techniques. However, in this paper, we provide empirical evidence that stain normalization methods do not result in any significant reduction of the batch effect. This is done via clustering analysis of the dataset as well as training weakly-supervised models to predict source sites. This study aims to open up avenues for further research for effective handling of batch effects for improving trustworthiness and generalization of predictive modelling in the Computational Pathology domain.