Privacy-Preserving EHR Data Transformation via Geometric Operators: A Human-AI Co-Design Technical Report

Maolin Wang; Beining Bao; Gan Yuan; Hongyu Chen; Bingkun Zhao; Baoshuo Kan; Jiming Xu; Qi Shi; Yinggong Zhao; Yao Wang; Wei Ying Ma; and Jun Yan

arXiv:2603.22954·cs.CR·March 25, 2026

Privacy-Preserving EHR Data Transformation via Geometric Operators: A Human-AI Co-Design Technical Report

Maolin Wang, Beining Bao, Gan Yuan, Hongyu Chen, Bingkun Zhao, Baoshuo Kan, Jiming Xu, Qi Shi, Yinggong Zhao, Yao Wang, Wei Ying Ma, and Jun Yan

PDF

Open Access

TL;DR

This paper introduces a novel data transformation framework for privacy-preserving sharing of electronic health records, maintaining data utility while protecting patient privacy through geometric operators and collaborative human-AI design.

Contribution

It presents a new approach using geometric operators for data transformation that preserves clinical data utility and ensures privacy, supported by theoretical and empirical analysis.

Findings

01

Transforms maintain medical semantics and statistical properties.

02

Provably breaks linkage between data views and patient attributes.

03

Effective against reconstruction, linkage, and inference attacks.

Abstract

Electronic health records (EHRs) and other real-world clinical data are essential for clinical research, medical artificial intelligence, and life science, but their sharing is severely limited by privacy, governance, and interoperability constraints. These barriers create persistent data silos that hinder multi-center studies, large-scale model development, and broader biomedical discovery. Existing privacy-preserving approaches, including multi-party computation and related cryptographic techniques, provide strong protection but often introduce substantial computational overhead, reducing the efficiency of large-scale machine learning and foundation-model training. In addition, many such methods make data usable for restricted computation while leaving them effectively invisible to clinicians and researchers, limiting their value in workflows that still require direct inspection,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Machine Learning in Healthcare · Adversarial Robustness in Machine Learning