The Price of Tailoring the Index to Your Data: Poisoning Attacks on Learned Index Structures
Evgenios M. Kornaropoulos, Silei Ren, Roberto Tamassia

TL;DR
This paper investigates poisoning attacks on learned index structures, revealing vulnerabilities where malicious data injections significantly increase model errors, especially in recursive models like RMI.
Contribution
It introduces the first poisoning attack methods on learned index structures, including linear regression and recursive models, demonstrating their security vulnerabilities.
Findings
Attacks can increase RMI error up to 300 times.
Second-stage model errors can increase up to 3000 times.
Poisoning impacts are significant on real-world and synthetic datasets.
Abstract
The concept of learned index structures relies on the idea that the input-output functionality of a database index can be viewed as a prediction task and, thus, be implemented using a machine learning model instead of traditional algorithmic techniques. This novel angle for a decades-old problem has inspired numerous exciting results in the intersection of machine learning and data structures. However, the main advantage of learned index structures, i.e., the ability to adjust to the data at hand via the underlying ML-model, can become a disadvantage from a security perspective as it could be exploited. In this work, we present the first study of poisoning attacks on learned index structures. The required poisoning approach is different from all previous works since the model under attack is trained on a cumulative distribution function (CDF) and, thus, every injection on the training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData-Driven Disease Surveillance · Data Quality and Management · Pharmacovigilance and Adverse Drug Reactions
MethodsLinear Regression
