Latency Optimization for Resource Allocation in Cloud Computing System
Masoud Nosrati, Abdolah Chalechale, and Ronak Karimi

TL;DR
This paper surveys resource allocation in cloud computing, then proposes an latency-aware optimization method that improves response time and resource detection, especially as task numbers increase.
Contribution
It introduces a latency-based resource allocation method with a history table and probability matrix, enhancing cloud system performance and fault detection capabilities.
Findings
Improved response time with increased tasks
Effective detection of unavailable resources through latency measurement
Enhanced support for migration, replication, and fault tolerance
Abstract
Recent studies in different fields of science caused emergence of needs for high performance computing systems like Cloud. A critical issue in design and implementation of such systems is resource allocation which is directly affected by internal and external factors like the number of nodes, geographical distance and communication latencies. Many optimizations took place in resource allocation methods in order to achieve better performance by concentrating on computing, network and energy resources. Communication latencies as a limitation of network resources have always been playing an important role in parallel processing (especially in fine-grained programs). In this paper, we are going to have a survey on the resource allocation issue in Cloud and then do an optimization on common resource allocation method based on the latencies of communications. Due to it, we added a table to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
