Adaptive Logging for Distributed In-memory Databases

Chang Yao; Divyakant Agrawal; Gang Chen; Beng Chin Ooi; Sai Wu

arXiv:1503.03653·cs.DB·April 28, 2015·5 cites

Adaptive Logging for Distributed In-memory Databases

Chang Yao, Divyakant Agrawal, Gang Chen, Beng Chin Ooi, Sai Wu

PDF

Open Access

TL;DR

This paper introduces an adaptive logging method for distributed in-memory databases that balances transaction processing speed and recovery efficiency by combining data and command logging techniques.

Contribution

It extends command logging to distributed systems and proposes an adaptive approach that optimizes logging strategies based on application needs.

Findings

01

Adaptive logging achieves 10x faster recovery.

02

Transaction throughput is comparable to command logging.

03

Outperforms traditional data logging in distributed environments.

Abstract

A new type of logs, the command log, is being employed to replace the traditional data log (e.g., ARIES log) in the in-memory databases. Instead of recording how the tuples are updated, a command log only tracks the transactions being executed, thereby effectively reducing the size of the log and improving the performance. Command logging on the other hand increases the cost of recovery, because all the transactions in the log after the last checkpoint must be completely redone in case of a failure. In this paper, we first extend the command logging technique to a distributed environment, where all the nodes can perform recovery in parallel. We then propose an adaptive logging approach by combining data logging and command logging. The percentage of data logging versus command logging becomes an optimization between the performance of transaction processing and recovery to suit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed systems and fault tolerance · Advanced Database Systems and Queries · Distributed and Parallel Computing Systems