FLAC: A Robust Failure-Aware Atomic Commit Protocol for Distributed Transactions
Hexiang Pan, Quang-Trung Ta, Meihui Zhang, Yeow Meng Chee, Gang Chen,, Beng Chin Ooi

TL;DR
FLAC is a failure-aware atomic commit protocol that dynamically adapts to different failure environments using reinforcement learning, significantly improving throughput and latency in distributed transactions.
Contribution
The paper introduces FLAC, a novel ACP that switches protocols based on failure conditions, enhancing efficiency and robustness in distributed transaction processing.
Findings
Up to 2.22x throughput improvement
Up to 2.82x latency speedup
Effective handling of crash and network failures
Abstract
In distributed transaction processing, atomic commit protocol (ACP) is used to ensure database consistency. With the use of commodity compute nodes and networks, failures such as system crashes and network partitioning are common. It is therefore important for ACP to dynamically adapt to the operating condition for efficiency while ensuring the consistency of the database. Existing ACPs often assume stable operating conditions, hence, they are either non-generalizable to different environments or slow in practice. In this paper, we propose a novel and practical ACP, called Failure-Aware Atomic Commit (FLAC). In essence, FLAC includes three protocols, which are specifically designed for three different environments: (i) no failure occurs, (ii) participant nodes might crash but there is no delayed connection, or (iii) both crashed nodes and delayed connection can occur. It models these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Cloud Computing and Resource Management · Cryptography and Data Security
