Benchmarking the Utility of Privacy-Preserving Cox Regression Under Data-Driven Clipping Bounds: A Multi-Dataset Simulation Study
Keita Fukuyama, Yukiko Mori, Tomohiro Kuroda, Hiroaki Kikuchi

TL;DR
This study systematically evaluates how differential privacy mechanisms affect the utility of Cox regression models across multiple datasets, revealing significant utility loss at strict privacy levels.
Contribution
It provides the first comprehensive analysis of DP impacts on Cox models using data-driven clipping bounds and multiple perturbation strategies.
Findings
At strict DP levels ($\\varepsilon \\leq 1$), most covariates lose significance.
Predictive performance degrades to near-random levels under many conditions.
Perturbing only covariates best preserves model utility at moderate privacy levels.
Abstract
Differential privacy (DP) is a mathematical framework that guarantees individual privacy; however, systematic evaluation of its impact on statistical utility in survival analyses remains limited. In this study, we systematically evaluated the impact of DP mechanisms (Laplace mechanism and Randomized Response) with data-driven clipping bounds on the Cox proportional hazards model, using 5 clinical datasets (--), 15 levels of (0.1--1000), and Monte Carlo iterations. The data-driven clipping bounds used here are observed min/max and therefore do not provide formal -DP guarantees; the results represent an optimistic lower bound on utility degradation under formal DP. We compared three types of input perturbations (covariates only, all inputs, and the discrete-time model) with output perturbations (dfbeta-based sensitivity), using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
