FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations
Marie Siew, Shikhar Sharma, Zekai Li, Kun Guo, Chao Xu, Tania Lorido-Botran, Tony Q.S. Quek, Carlee Joe-Wong

TL;DR
FIRE is a reinforcement learning framework designed for edge computing that adapts to rare server failures by training in a digital twin environment, improving migration cost efficiency during failures.
Contribution
It introduces ImRE, an importance sampling-based Q-learning algorithm, and scalable deep RL variants, to effectively handle rare failure events in edge computing migrations.
Findings
FIRE reduces migration costs compared to baseline methods.
ImRE converges to optimality with boundedness guarantees.
Framework accommodates users with different risk tolerances.
Abstract
In edge computing, users' service profiles are migrated due to user mobility. Reinforcement learning (RL) frameworks have been proposed to do so, often trained on simulated data. However, existing RL frameworks overlook occasional server failures, which although rare, impact latency-sensitive applications like autonomous driving and real-time obstacle detection. Nevertheless, these failures (rare events), being not adequately represented in historical training data, pose a challenge for data-driven RL algorithms. As it is impractical to adjust failure frequency in real-world applications for training, we introduce FIRE, a framework that adapts to rare events by training a RL policy in an edge computing digital twin environment. We propose ImRE, an importance sampling-based Q-learning algorithm, which samples rare events proportionally to their impact on the value function. FIRE…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAge of Information Optimization · IoT and Edge/Fog Computing · Privacy-Preserving Technologies in Data
Methodstravel james · Q-Learning
