Cost-Driven Data Replication with Predictions
Tianyu Zuo, Xueyan Tang, Bu Sung Lee

TL;DR
This paper introduces a cost-efficient online data replication algorithm that leverages simple predictions to adaptively manage data copies across servers, balancing storage and network costs in dynamic environments.
Contribution
It develops a learning-augmented online algorithm with proven competitiveness bounds and analyzes the impact of prediction errors on performance.
Findings
Algorithm achieves ($rac{5+ ext{α}}{3}$)-competitiveness with perfect predictions.
Algorithm maintains bounded robustness under prediction errors.
Experimental results validate the effectiveness of the proposed approach.
Abstract
This paper studies an online replication problem for distributed data access. The goal is to dynamically create and delete data copies in a multi-server system as time passes to minimize the total storage and network cost of serving access requests. We study the problem in the emergent learning-augmented setting, assuming simple binary predictions about inter-request times at individual servers. We develop an online algorithm and prove that it is ()-consistent (competitiveness under perfect predictions) and ()-robust (competitiveness under terrible predictions), where is a hyper-parameter representing the level of distrust in the predictions. We also study the impact of mispredictions on the competitive ratio of the proposed algorithm and adapt it to achieve a bounded robustness while retaining its consistency. We further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
