Open Data Sharing in Clinical Research and Participants Privacy: Challenges and Opportunities in the Era of Artificial Intelligence
Shahin Hallaj, Anna Heinke, Fritz Gerald P. Kalaw, Nayoon Gim, Marian Blazes, Julia Owen, Eamon Dysinger, Erik S. Benton, Benjamin A. Cordier, Nicholas G. Evans, Jennifer Li-Pook-Than, Michael P. Snyder, Camille Nebeker, Linda M. Zangwill, Sally L. Baxter, Shannon McWeeney

TL;DR
This paper discusses the challenges AI poses to maintaining participant privacy in clinical data sharing and proposes new approaches to balance openness with confidentiality, exemplified by a novel data sharing model in a diabetes research project.
Contribution
It introduces the concept of pseudo-reidentification and presents a new open data sharing framework tailored for AI-era clinical research.
Findings
AI increases reidentification risks in shared clinical data
Proposed a novel open data sharing approach for AI-ready research
Demonstrated the approach in a diabetes insights project
Abstract
Sharing clinical research data is key for increasing the pace of medical discoveries that improve human health. However, concern about study participants' privacy, confidentiality, and safety is a major factor that deters researchers from openly sharing clinical data, even after deidentification. This concern is further heightened by the evolution of artificial intelligence (AI) approaches that pose an ever-increasing threat to the reidentification of study participants. Here, we discuss the challenges AI approaches create that blur the lines between identifiable and non-identifiable data. We present a concept of pseudo-reidentification, and discuss how these challenges provide opportunities for rethinking open data sharing practices in clinical research. We highlight the novel open data sharing approach we have established as part of the Artificial Intelligence Ready and Exploratory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
