Privacy-preserving feature selection: A survey and proposing a new set of protocols
Javad Rahimipour Anaraki, Saeed Samet

TL;DR
This paper surveys privacy-preserving feature selection methods, identifies gaps, and proposes a new rough set-based protocol capable of handling distributed datasets while safeguarding user privacy.
Contribution
It introduces a novel privacy-preserving feature selection method suitable for distributed data, enhancing privacy and applicability over existing approaches.
Findings
Reviewed three existing privacy-preserving feature selection methods
Identified performance gaps in current methods
Proposed a new rough set-based protocol for distributed datasets
Abstract
Feature selection is the process of sieving features, in which informative features are separated from the redundant and irrelevant ones. This process plays an important role in machine learning, data mining and bioinformatics. However, traditional feature selection methods are only capable of processing centralized datasets and are not able to satisfy today's distributed data processing needs. These needs require a new category of data processing algorithms called privacy-preserving feature selection, which protects users' data by not revealing any part of the data neither in the intermediate processing nor in the final results. This is vital for the datasets which contain individuals' data, such as medical datasets. Therefore, it is rational to either modify the existing algorithms or propose new ones to not only introduce the capability of being applied to distributed datasets, but…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Data Mining Algorithms and Applications · Rough Sets and Fuzzy Logic
MethodsFeature Selection
