# Compaction-Aware Flash Memory Remapping for Key–Value Stores

**Authors:** Jialin Wang, Zhen Yang, Yi Fan, Yajuan Du

PMC · DOI: 10.3390/mi16060699 · 2025-06-11

## TL;DR

This paper introduces RemapCom, a new design for key-value stores that reduces write amplification and improves performance in flash memory systems.

## Contribution

The novel contribution is a compaction-aware remapping technique that identifies and retains unchanged data blocks to reduce write amplification.

## Key findings

- RemapCom reduces write amplification by up to 53%.
- The design improves write throughput by up to 30% compared to existing methods.
- A prototype implementation on LevelDB demonstrates the effectiveness of the approach.

## Abstract

With the rapid development of big data and artificial intelligence, the demand for memory has exploded. As a key data structure in modern databases and distributed storage systems, the Log-Structured Merge Tree (LSM-tree) has been widely employed (such as LevelDB, RocksDB, etc.) in systems based on key–value pairs due to its efficient writing performance. In LSM-tree-based KV stores, typically deployed on systems with DRAM-SSD storage, the KV items are first organized into MemTable as buffer for SSTables in main memory. When the buffer size exceeds the threshold, MemTable is flushed to the SSD and reorganized into an SSTable, which is then passed down level by level through compaction. However, the compaction degrades write performance and SSD endurance due to significant write amplification. To address this issue, recent proposals have mostly focused on redesigning the structure of LSM trees. We discover the prevalence of unchanged data blocks (UDBs) in the LSM-tree compaction process, i.e., UDBs are written back to SSD the same as they are read into memory, which induces extra write amplification and degrades I/O performance. In this paper, we propose a KV store design in SSD, called RemapCom, to exploit remapping on these UDBs. RemapCom first identifies UDBs with a lightweight state machine integrated into the compaction merge process. In order to increase the ratio of UDBs, RemapCom further designs a UDB retention method to further develop the benefit of remapping. Moreover, we implement a prototype of RemapCom on LevelDB by providing two primitives for the remapping. Compared to the state of the art, the evaluation results demonstrate that RemapCom can reduce write amplification by up to 53% and improve write throughput by up to 30%.

## Full-text entities

- **Chemicals:** UDB (-)

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12195135/full.md

---
Source: https://tomesphere.com/paper/PMC12195135