Data Alchemy: Mitigating Cross-Site Model Variability Through Test Time Data Calibration
Abhijeet Parida, Antonia Alomar, Zhifan Jiang, Pooneh Roshanitabrizi,, Austin Tapp, Maria Ledesma-Carbayo, Ziyue Xu, Syed Muhammed Anwar, Marius, George Linguraru, Holger R. Roth

TL;DR
Data Alchemy introduces an explainable stain normalization and test time data calibration method that effectively reduces cross-site variability in histopathology imaging, enhancing tumor classification performance without altering model weights.
Contribution
The paper presents a novel explainable stain normalization combined with test time calibration that handles unseen site shifts, improving multi-site tumor classification accuracy.
Findings
Boosts AUPR by 0.165 with normalization
Further improves AUPR to 0.852 after calibration
Reduces inter-site domain gap significantly
Abstract
Deploying deep learning-based imaging tools across various clinical sites poses significant challenges due to inherent domain shifts and regulatory hurdles associated with site-specific fine-tuning. For histopathology, stain normalization techniques can mitigate discrepancies, but they often fall short of eliminating inter-site variations. Therefore, we present Data Alchemy, an explainable stain normalization method combined with test time data calibration via a template learning framework to overcome barriers in cross-site analysis. Data Alchemy handles shifts inherent to multi-site data and minimizes them without needing to change the weights of the normalization or classifier networks. Our approach extends to unseen sites in various clinical settings where data domain discrepancies are unknown. Extensive experiments highlight the efficacy of our framework in tumor classification in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Software System Performance and Reliability · Anomaly Detection Techniques and Applications
