Adaptive Stochastic ADMM for Decentralized Reinforcement Learning in   Edge Industrial IoT

Wanlu Lei; Yu Ye; Ming Xiao; Mikael Skoglund; Zhu Han

arXiv:2107.00481·cs.LG·July 2, 2021·1 cites

Adaptive Stochastic ADMM for Decentralized Reinforcement Learning in Edge Industrial IoT

Wanlu Lei, Yu Ye, Ming Xiao, Mikael Skoglund, Zhu Han

PDF

Open Access

TL;DR

This paper introduces an adaptive stochastic incremental ADMM algorithm tailored for decentralized reinforcement learning in edge-based IIoT networks, addressing communication efficiency, scalability, and adaptability challenges.

Contribution

It proposes a novel adaptive stochastic ADMM method with proven convergence for decentralized RL, enhancing performance in complex, heterogeneous IoT environments.

Findings

01

Outperforms existing methods in communication efficiency

02

Demonstrates scalability in large IoT networks

03

Adapts effectively to heterogeneous agent environments

Abstract

Edge computing provides a promising paradigm to support the implementation of Industrial Internet of Things (IIoT) by offloading tasks to nearby edge nodes. Meanwhile, the increasing network size makes it impractical for centralized data processing due to limited bandwidth, and consequently a decentralized learning scheme is preferable. Reinforcement learning (RL) has been widely investigated and shown to be a promising solution for decision-making and optimal control processes. For RL in a decentralized setup, edge nodes (agents) connected through a communication network aim to work collaboratively to find a policy to optimize the global reward as the sum of local rewards. However, communication costs, scalability and adaptation in complex environments with heterogeneous agents may significantly limit the performance of decentralized RL. Alternating direction method of multipliers…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Control Multi-Agent Systems · Energy Harvesting in Wireless Networks · Advanced MIMO Systems Optimization

MethodsAlternating Direction Method of Multipliers