Privacy-Preserving Linkage of Distributed Datasets using the Personal Health Train
Maximilian Jugl, Sascha Welten, Yongli Mou, Yeliz Ucer Yediel, Oya, Deniz Beyan, Ulrich Sax, Toralf Kirsten

TL;DR
This paper introduces a privacy-preserving method for linking distributed datasets in healthcare using the Personal Health Train concept, enabling secure analysis without exposing sensitive data.
Contribution
It presents a novel approach for record linkage across distributed datasets leveraging the Personal Health Train framework, enhancing data quality while maintaining privacy.
Findings
Effective linkage of real-world datasets demonstrated
Preserves data privacy during analysis
Applicable to distributed healthcare data analysis
Abstract
With the generation of personal and medical data at several locations, medical data science faces unique challenges when working on distributed datasets. Growing data protection requirements in recent years drastically limit the use of personally identifiable information. Distributed data analysis aims to provide solutions for securely working on highly sensitive data while minimizing the risk of information leaks, which would not be possible to the same degree in a centralized approach. A novel concept in this field is the Personal Health Train (PHT), which encapsulates the idea of bringing the analysis to the data, not vice versa. Data sources are represented as train stations. Trains containing analysis tasks move between stations and aggregate results. Train executions are coordinated by a central station which data analysts can interact with. Data remains at their respective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Privacy-Preserving Technologies in Data · Distributed systems and fault tolerance
