PPaaS: Privacy Preservation as a Service
Pathum Chamikara Mahawaga Arachchige, Peter Bertok, Ibrahim Khalil,, Dongxi Liu, Seyit Camtepe

TL;DR
PPaaS is a flexible framework that dynamically selects optimal privacy-preserving data perturbation methods tailored to specific datasets and applications, balancing privacy and utility in data sharing.
Contribution
Introduces PPaaS, a generic platform that automates the selection of privacy-preserving techniques for data sharing, considering dataset characteristics and application needs.
Findings
Enhances privacy-utility trade-off in data sharing.
Provides a flexible, application-specific privacy preservation platform.
Supports big data sanitization with diverse perturbation methods.
Abstract
Personally identifiable information (PII) can find its way into cyberspace through various channels, and many potential sources can leak such information. Data sharing (e.g. cross-agency data sharing) for machine learning and analytics is one of the important components in data science. However, due to privacy concerns, data should be enforced with strong privacy guarantees before sharing. Different privacy-preserving approaches were developed for privacy preserving data sharing; however, identifying the best privacy-preservation approach for the privacy-preservation of a certain dataset is still a challenge. Different parameters can influence the efficacy of the process, such as the characteristics of the input dataset, the strength of the privacy-preservation approach, and the expected level of utility of the resulting dataset (on the corresponding data mining application such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
