Intelligent Proactive Fault Tolerance at the Edge through Resource Usage Prediction
Theodoros Theodoropoulos, John Violos, Stylianos Tsanakas, Aris, Leivadeas, Konstantinos Tserpes, Theodora Varvarigou

TL;DR
This paper introduces an intelligent proactive fault tolerance system for edge computing that uses deep learning to predict resource usage, enabling preemptive actions to maintain QoS in dynamic environments.
Contribution
It presents a novel hybrid Bayesian evolution strategy for model adaptation and a composite deep learning architecture for resource prediction at the edge.
Findings
Significant reduction in prediction errors (RMSE and MAE) compared to state-of-the-art methods.
Enhanced fault tolerance performance with improved reliability and maintainability in simulations.
Effective proactive node replication and task migration based on predicted resource usage.
Abstract
The proliferation of demanding applications and edge computing establishes the need for an efficient management of the underlying computing infrastructures, urging the providers to rethink their operational methods. In this paper, we propose an Intelligent Proactive Fault Tolerance (IPFT) method that leverages the edge resource usage predictions through Recurrent Neural Networks (RNN). More specifically, we focus on the process-faults, which are related with the inability of the infrastructure to provide Quality of Service (QoS) in acceptable ranges due to the lack of processing power. In order to tackle this challenge we propose a composite deep learning architecture that predicts the resource usage metrics of the edge nodes and triggers proactive node replications and task migration. Taking also into consideration that the edge computing infrastructure is also highly dynamic and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodstravel james
