Privacy enhanced collaborative inference in the Cox proportional hazards model for distributed data
Mengtong Hu, Xu Shi, Peter X.-K. Song

TL;DR
This paper introduces a privacy-preserving collaborative Cox proportional hazards model for distributed data that avoids centralized data sharing and risk set construction, maintaining statistical power while enhancing data privacy.
Contribution
The paper proposes a novel distributed Cox model that shares only summary statistics, eliminating the need for centralized data and risk set sharing, thus improving privacy in multicenter studies.
Findings
The method retains similar statistical power to centralized analysis.
Simulation studies demonstrate the effectiveness of the approach.
Application to kidney transplant data illustrates practical utility.
Abstract
Data sharing barriers are paramount challenges arising from multicenter clinical studies where multiple data sources are stored in a distributed fashion at different local study sites. Particularly in the case of time-to-event analysis when global risk sets are needed for the Cox proportional hazards model, access to a centralized database is typically necessary. Merging such data sources into a common data storage for a centralized statistical analysis requires a data use agreement, which is often time-consuming. Furthermore, the construction and distribution of risk sets to participating clinical centers for subsequent calculations may pose a risk of revealing individual-level information. We propose a new collaborative Cox model that eliminates the need for accessing the centralized database and constructing global risk sets but needs only the sharing of summary statistics with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Statistical Methods and Inference · Bayesian Modeling and Causal Inference
