On Throughput-Delay Optimal Access to Storage Clouds via Load Adaptive Coding and Chunking
Guanfeng Liang, Ulas C. Kozat

TL;DR
This paper introduces TOFEC, an adaptive coding and chunking scheme for storage clouds that optimizes throughput and delay by dynamically adjusting to load conditions, significantly improving performance over static strategies.
Contribution
It presents the first comprehensive adaptive approach combining coding rate and chunking for storage cloud performance optimization, validated through measurement and queueing analysis.
Findings
TOFEC adapts to load, achieving optimal throughput-delay trade-off.
TOFEC reduces latency by 2.5x under light workloads.
TOFEC supports over 3x more requests compared to latency-optimized strategies.
Abstract
Recent literature including our past work provide analysis and solutions for using (i) erasure coding, (ii) parallelism, or (iii) variable slicing/chunking (i.e., dividing an object of a specific size into a variable number of smaller chunks) in speeding the I/O performance of storage clouds. However, a comprehensive approach that considers all three dimensions together to achieve the best throughput-delay trade-off curve had been lacking. This paper presents the first set of solutions that can pick the best combination of coding rate and object chunking/slicing options as the load dynamically changes. Our specific contributions are as follows: (1) We establish via measurement that combining variable coding rate and chunking is mostly feasible over a popular public cloud. (2) We relate the delay optimal values for chunking level and code rate to the queue backlogs via an approximate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Distributed systems and fault tolerance · Cloud Computing and Resource Management
