Balancing Privacy and Progress in Artificial Intelligence: Anonymization in Histopathology for Biomedical Research and Education
Neel Kanwal, Emiel A.M. Janssen, Kjersti Engan

TL;DR
This paper examines the legal and technical challenges of sharing histopathology data for AI research, proposing guidelines to balance patient privacy with scientific progress.
Contribution
It reviews current regulations and approaches, highlighting challenges and offering a data-sharing guideline for histological data to support research and education.
Findings
Legal regulations often overlook linkage attack risks.
Standardization issues hinder universal anonymization solutions.
A proposed guideline aims to facilitate safe data sharing.
Abstract
The advancement of biomedical research heavily relies on access to large amounts of medical data. In the case of histopathology, Whole Slide Images (WSI) and clinicopathological information are valuable for developing Artificial Intelligence (AI) algorithms for Digital Pathology (DP). Transferring medical data "as open as possible" enhances the usability of the data for secondary purposes but poses a risk to patient privacy. At the same time, existing regulations push towards keeping medical data "as closed as necessary" to avoid re-identification risks. Generally, these legal regulations require the removal of sensitive data but do not consider the possibility of data linkage attacks due to modern image-matching algorithms. In addition, the lack of standardization in DP makes it harder to establish a single solution for all formats of WSIs. These challenges raise problems for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Artificial Intelligence in Healthcare and Education · Privacy-Preserving Technologies in Data
