Improving Grid Computing Performance by Optimally Reducing Checkpointing Effect
Garba Aliyu, Kana A. F. D., Abdullahi Mohammed, Idris Abdulmumin,, Shehu Adamu, Fatsuma Jauro

TL;DR
This paper presents an improved checkpointing system for grid computing that reduces runtime overhead by adding resource replicas and replicating checkpoint files, leading to performance gains in simulation.
Contribution
The paper introduces a novel checkpointing approach with resource replication to lower overhead and improve grid computing performance.
Findings
Up to 11% improvement in makespan
Up to 9% increase in throughput
Up to 11% reduction in turnaround time
Abstract
Grid computing is a collection of computer resources that are gathered together from various areas to give computational resources such as storage, data or application services. This is to permit clients to access this huge measure of processing resources without the need to know where these might be found and what technology such as, hardware equipment and operating system was used. Dependability and performance are among the key difficulties faced in a grid computing environment. Various systems have been proposed in the literature to handle recouping from resource failure in Grid computing environment. One case of such system is checkpointing. Checkpointing is a system that endures faults when resources failed. Checkpointing method has the upside of lessening the work lost because of resource faults. However, checkpointing presents a huge runtime overhead. In this paper, we propose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Parallel Computing and Optimization Techniques · Advanced Data Storage Technologies
