Network Bandwidth Variation-Adapted State Transfer for Geo-Replicated State Machines and its Application to Dynamic Replica Replacement
Tairi Chiba, Ren Ohmura, Junya Nakamura

TL;DR
This paper introduces a dynamic, bandwidth-aware state transfer method for geo-replicated state machines that significantly reduces transfer time and improves replica replacement efficiency in variable network conditions.
Contribution
It presents a novel bandwidth-adaptive state transfer approach that dynamically allocates state chunks based on current network bandwidths, enhancing efficiency over existing methods.
Findings
Reduces state transfer time by up to 47% in experiments.
Effectively adapts to changing bandwidths for optimized data transfer.
Improves replica replacement speed, mitigating latency issues.
Abstract
This paper proposes a new state transfer method for geographic state machine replication (SMR) that dynamically allocates the state to be transferred among replicas according to changes in communication bandwidths. SMR improves fault tolerance by replicating a service to multiple replicas. When a replica is newly added or recovered from a failure, the other replicas transfer the current state of the service to it. However, in geographic SMR, the communication bandwidths of replicas are different and constantly changing. Therefore, existing state transfer methods cannot fully utilize the available bandwidth, and their state transfer time increases. To overcome this problem, our method divides the state into multiple chunks and assigns them to replicas based on each replica's bandwidth so that the broader a replica's bandwidth is, the more chunks it transfers. The proposed method also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Caching and Content Delivery · IoT and Edge/Fog Computing
