FRL-FI: Transient Fault Analysis for Federated Reinforcement   Learning-Based Navigation Systems

Zishen Wan; Aqeel Anwar; Abdulrahman Mahmoud; Tianyu Jia; Yu-Shun; Hsiao; Vijay Janapa Reddi; Arijit Raychowdhury

arXiv:2203.07276·cs.LG·March 15, 2022·1 cites

FRL-FI: Transient Fault Analysis for Federated Reinforcement Learning-Based Navigation Systems

Zishen Wan, Aqeel Anwar, Abdulrahman Mahmoud, Tianyu Jia, Yu-Shun, Hsiao, Vijay Janapa Reddi, Arijit Raychowdhury

PDF

Open Access

TL;DR

This paper evaluates the fault tolerance of federated reinforcement learning navigation systems and proposes two efficient fault detection methods that significantly improve resilience with minimal overhead.

Contribution

It introduces a comprehensive fault analysis for FRL navigation systems and presents two novel, cost-effective fault detection and recovery techniques.

Findings

01

Fault tolerance varies with fault models and system parameters.

02

Proposed techniques achieve up to 3.3x resilience improvement.

03

Overhead of the methods is less than 2.7%.

Abstract

Swarm intelligence is being increasingly deployed in autonomous systems, such as drones and unmanned vehicles. Federated reinforcement learning (FRL), a key swarm intelligence paradigm where agents interact with their own environments and cooperatively learn a consensus policy while preserving privacy, has recently shown potential advantages and gained popularity. However, transient faults are increasing in the hardware system with continuous technology node scaling and can pose threats to FRL systems. Meanwhile, conventional redundancy-based protection methods are challenging to deploy on resource-constrained edge applications. In this paper, we experimentally evaluate the fault tolerance of FRL navigation systems at various scales with respect to fault models, fault locations, learning algorithms, layer types, communication intervals, and data types at both training and inference…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Age of Information Optimization · Adversarial Robustness in Machine Learning