An Improved Multiple Faults Reassignment based Recovery in Cluster Computing
Sanjay Bansal, Sanjeev Sharma

TL;DR
This paper proposes a two-phase recovery scheme for cluster computing that efficiently handles multiple node failures by dynamically redistributing loads, improving performance and resource utilization compared to existing single-fault methods.
Contribution
It introduces a novel two-phase recovery approach with dynamic load redistribution and failure node reassignment algorithms for multiple fault tolerance in cluster computing.
Findings
Improved load balancing reduces recovery time.
Enhanced resource utilization during multiple node failures.
Lower communication overhead compared to traditional methods.
Abstract
In case of multiple node failures performance becomes very low as compare to single node failure. Failures of nodes in cluster computing can be tolerated by multiple fault tolerant computing. Existing recovery schemes are efficient for single fault but not with multiple faults. Recovery scheme proposed in this paper having two phases; sequentially phase, concurrent phase. In sequentially phase, loads of all working nodes are uniformly and evenly distributed by proposed dynamic rank based and load distribution algorithm. In concurrent phase, loads of all failure nodes as well as new job arrival are assigned equally to all available nodes by just finding the least loaded node among the several nodes by failure nodes job allocation algorithm. Sequential and concurrent executions of algorithms improve the performance as well better resource utilization. Dynamic rank based algorithm for load…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPeer-to-Peer Network Technologies · Cloud Computing and Resource Management · Distributed and Parallel Computing Systems
