Energy cost and machine learning accuracy impact of k-anonymisation and synthetic data techniques
Pepijn de Reus, Ana Oprescu, Koen van Elsen

TL;DR
This study evaluates how k-anonymisation and synthetic data techniques affect the energy consumption and accuracy of machine learning models, finding that k-anonymisation reduces energy use without sacrificing accuracy, while synthetic data maintains energy use but may lower accuracy.
Contribution
It provides a comparative analysis of energy and accuracy impacts of k-anonymisation and synthetic data techniques on machine learning models.
Findings
K-anonymisation reduces energy consumption with similar accuracy.
Synthetic data maintains energy consumption but can lower accuracy.
Models trained on original data serve as baseline for comparison.
Abstract
To address increasing societal concerns regarding privacy and climate, the EU adopted the General Data Protection Regulation (GDPR) and committed to the Green Deal. Considerable research studied the energy efficiency of software and the accuracy of machine learning models trained on anonymised data sets. Recent work began exploring the impact of privacy-enhancing techniques (PET) on both the energy consumption and accuracy of the machine learning models, focusing on k-anonymity. As synthetic data is becoming an increasingly popular PET, this paper analyses the energy consumption and accuracy of two phases: a) applying privacy-enhancing techniques to the concerned data set, b) training the models on the concerned privacy-enhanced data set. We use two privacy-enhancing techniques: k-anonymisation (using generalisation and suppression) and synthetic data, and three machine-learning models.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Privacy, Security, and Data Protection · Ethics and Social Impacts of AI
