Lethe: A Tunable Delete-Aware LSM Engine (Updated Version)
Subhadeep Sarkar, Tarikul Islam Papon, Dimitris Staratzis, Manos, Athanassoulis

TL;DR
Lethe is a new LSM-based key-value engine that efficiently supports fast, tunable delete operations on any attribute, improving space efficiency and read throughput while enabling range deletes without full reorganization.
Contribution
It introduces a delete-aware data layout, new compaction policies, and supports user-defined delete latency thresholds, advancing delete support in LSM engines.
Findings
Supports user-defined delete latency thresholds
Achieves 1.17-1.4x higher read throughput
Reduces space amplification by 2.1-9.8x
Abstract
Data-intensive applications fueled the evolution of log structured merge (LSM) based key-value engines that employ the out-of-place paradigm to support high ingestion rates with low read/write interference. These benefits, however, come at the cost of treating deletes as a second-class citizen. A delete inserts a tombstone that invalidates older instances of the deleted key. State-of-the-art LSM engines do not provide guarantees as to how fast a tombstone will propagate to persist the deletion. Further, LSM engines only support deletion on the sort key. To delete on another attribute (e.g., timestamp), the entire tree is read and re-written. We highlight that fast persistent deletion without affecting read performance is key to support: (i) streaming systems operating on a window of data, (ii) privacy with latency guarantees on the right-to-be-forgotten, and (iii) en masse cloud…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Parallel Computing and Optimization Techniques · Caching and Content Delivery
