Cost-Driven Data Replication with Predictions

Tianyu Zuo; Xueyan Tang; Bu Sung Lee

arXiv:2404.16489·cs.DS·April 26, 2024

Cost-Driven Data Replication with Predictions

Tianyu Zuo, Xueyan Tang, Bu Sung Lee

PDF

TL;DR

This paper introduces a cost-efficient online data replication algorithm that leverages simple predictions to adaptively manage data copies across servers, balancing storage and network costs in dynamic environments.

Contribution

It develops a learning-augmented online algorithm with proven competitiveness bounds and analyzes the impact of prediction errors on performance.

Findings

01

Algorithm achieves ($rac{5+ ext{α}}{3}$)-competitiveness with perfect predictions.

02

Algorithm maintains bounded robustness under prediction errors.

03

Experimental results validate the effectiveness of the proposed approach.

Abstract

This paper studies an online replication problem for distributed data access. The goal is to dynamically create and delete data copies in a multi-server system as time passes to minimize the total storage and network cost of serving access requests. We study the problem in the emergent learning-augmented setting, assuming simple binary predictions about inter-request times at individual servers. We develop an online algorithm and prove that it is ( $\frac{5 + α}{3}$ )-consistent (competitiveness under perfect predictions) and ( $1 + \frac{1}{α}$ )-robust (competitiveness under terrible predictions), where $α \in (0, 1]$ is a hyper-parameter representing the level of distrust in the predictions. We also study the impact of mispredictions on the competitive ratio of the proposed algorithm and adapt it to achieve a bounded robustness while retaining its consistency. We further…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.