Correlated Differential Privacy: Feature Selection in Machine Learning
Tao Zhang, Tianqing Zhu, Ping Xiong, Huan Huo, Zahir Tari, Wanlei Zhou

TL;DR
This paper introduces a differentially private feature selection method that reduces data correlation to improve privacy preservation and prediction accuracy in industrial machine learning applications.
Contribution
It proposes a novel correlation reduction scheme for differentially private feature selection that accounts for data correlation, enhancing privacy and utility in industrial settings.
Findings
Better prediction results in machine learning tasks.
Fewer mean square errors for data queries.
Effective management of data correlation for privacy preservation.
Abstract
Privacy preserving in machine learning is a crucial issue in industry informatics since data used for training in industries usually contain sensitive information. Existing differentially private machine learning algorithms have not considered the impact of data correlation, which may lead to more privacy leakage than expected in industrial applications. For example, data collected for traffic monitoring may contain some correlated records due to temporal correlation or user correlation. To fill this gap, we propose a correlation reduction scheme with differentially private feature selection considering the issue of privacy loss when data have correlation in machine learning tasks. %The key to the proposed scheme is to describe the data correlation and select features which leads to less data correlation across the whole dataset. The proposed scheme involves five steps with the goal of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFeature Selection
