Privacy-Preserving Tensor Factorization for Collaborative Health Data Analysis
Jing Ma, Qiuchen Zhang, Jian Lou, Joyce C. Ho, Li Xiong, Xiaoqian, Jiang

TL;DR
DPFact is a novel privacy-preserving tensor factorization method enabling hospitals to collaboratively analyze electronic health records without sharing sensitive data, ensuring interpretability and efficiency under strict privacy constraints.
Contribution
The paper introduces DPFact, a new framework combining differential privacy and collaborative learning for tensor factorization in healthcare data analysis.
Findings
Outperforms baseline methods in accuracy under privacy constraints
Reduces communication overhead compared to existing approaches
Effectively handles heterogeneous patient populations
Abstract
Tensor factorization has been demonstrated as an efficient approach for computational phenotyping, where massive electronic health records (EHRs) are converted to concise and meaningful clinical concepts. While distributing the tensor factorization tasks to local sites can avoid direct data sharing, it still requires the exchange of intermediary results which could reveal sensitive patient information. Therefore, the challenge is how to jointly decompose the tensor under rigorous and principled privacy constraints, while still support the model's interpretability. We propose DPFact, a privacy-preserving collaborative tensor factorization method for computational phenotyping using EHR. It embeds advanced privacy-preserving mechanisms with collaborative learning. Hospitals can keep their EHR database private but also collaboratively learn meaningful clinical concepts by sharing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
