FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations

Marie Siew; Shikhar Sharma; Zekai Li; Kun Guo; Chao Xu; Tania Lorido-Botran; Tony Q.S. Quek; Carlee Joe-Wong

arXiv:2209.14399·cs.NI·October 21, 2025·1 cites

FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations

Marie Siew, Shikhar Sharma, Zekai Li, Kun Guo, Chao Xu, Tania Lorido-Botran, Tony Q.S. Quek, Carlee Joe-Wong

PDF

Open Access

TL;DR

FIRE is a reinforcement learning framework designed for edge computing that adapts to rare server failures by training in a digital twin environment, improving migration cost efficiency during failures.

Contribution

It introduces ImRE, an importance sampling-based Q-learning algorithm, and scalable deep RL variants, to effectively handle rare failure events in edge computing migrations.

Findings

01

FIRE reduces migration costs compared to baseline methods.

02

ImRE converges to optimality with boundedness guarantees.

03

Framework accommodates users with different risk tolerances.

Abstract

In edge computing, users' service profiles are migrated due to user mobility. Reinforcement learning (RL) frameworks have been proposed to do so, often trained on simulated data. However, existing RL frameworks overlook occasional server failures, which although rare, impact latency-sensitive applications like autonomous driving and real-time obstacle detection. Nevertheless, these failures (rare events), being not adequately represented in historical training data, pose a challenge for data-driven RL algorithms. As it is impractical to adjust failure frequency in real-world applications for training, we introduce FIRE, a framework that adapts to rare events by training a RL policy in an edge computing digital twin environment. We propose ImRE, an importance sampling-based Q-learning algorithm, which samples rare events proportionally to their impact on the value function. FIRE…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAge of Information Optimization · IoT and Edge/Fog Computing · Privacy-Preserving Technologies in Data

Methodstravel james · Q-Learning