Supporting Regularized Logistic Regression Privately and Efficiently
Wenfa Li, Hongzhe Liu, Peng Yang, Wei Xie

TL;DR
This paper presents a practical, privacy-preserving method for regularized logistic regression suitable for multi-institutional collaborative studies, ensuring data security while maintaining efficiency and scalability.
Contribution
It introduces a novel, computationally efficient approach leveraging distributed computing and cryptography to protect individual and summary data in logistic regression.
Findings
Validated privacy guarantees through extensive empirical evaluation
Demonstrated efficiency and scalability in large-scale studies
Applicable across disciplines like genetics, biomedical research, and network analysis
Abstract
As one of the most popular statistical and machine learning models, logistic regression with regularization has found wide adoption in biomedicine, social sciences, information technology, and so on. These domains often involve data of human subjects that are contingent upon strict privacy regulations. Increasing concerns over data privacy make it more and more difficult to coordinate and conduct large-scale collaborative studies, which typically rely on cross-institution data sharing and joint analysis. Our work here focuses on safeguarding regularized logistic regression, a widely-used machine learning model in various disciplines while at the same time has not been investigated from a data security and privacy perspective. We consider a common use scenario of multi-institution collaborative studies, such as in the form of research consortia or networks as widely seen in genetics,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
