Machine Unlearning in Learned Databases: An Experimental Analysis
Meghdad Kurmanji, Eleni Triantafillou, Peter Triantafillou

TL;DR
This paper conducts an experimental analysis of machine unlearning in neural network-based learned databases, exploring its effects, metrics, efficiency, and integration with data updates across various tasks and datasets.
Contribution
It provides the first comprehensive experimental evaluation of unlearning algorithms in learned databases, addressing effects on models, downstream tasks, metrics, and efficiency, and proposing a benchmark.
Findings
Unlearning algorithms significantly impact NN-based DB models.
Metrics for assessing unlearning efficacy are identified.
Unlearning overhead varies with batching and algorithm choice.
Abstract
Machine learning models based on neural networks (NNs) are enjoying ever-increasing attention in the DB community. However, an important issue has been largely overlooked, namely the challenge of dealing with the highly dynamic nature of DBs, where data updates are fundamental, highly-frequent operations. Although some recent research has addressed the issues of maintaining updated NN models in the presence of new data insertions, the effects of data deletions (a.k.a., "machine unlearning") remain a blind spot. With this work, for the first time to our knowledge, we pose and answer the following key questions: What is the effect of unlearning algorithms on NN-based DB models? How do these effects translate to effects on downstream DB tasks, such as selectivity estimation (SE), approximate query processing (AQP), data generation (DG), and upstream tasks like data classification (DC)?…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Machine Learning and Data Classification · Machine Learning and Algorithms
