Adaptive Stochastic ADMM for Decentralized Reinforcement Learning in Edge Industrial IoT
Wanlu Lei, Yu Ye, Ming Xiao, Mikael Skoglund, Zhu Han

TL;DR
This paper introduces an adaptive stochastic incremental ADMM algorithm tailored for decentralized reinforcement learning in edge-based IIoT networks, addressing communication efficiency, scalability, and adaptability challenges.
Contribution
It proposes a novel adaptive stochastic ADMM method with proven convergence for decentralized RL, enhancing performance in complex, heterogeneous IoT environments.
Findings
Outperforms existing methods in communication efficiency
Demonstrates scalability in large IoT networks
Adapts effectively to heterogeneous agent environments
Abstract
Edge computing provides a promising paradigm to support the implementation of Industrial Internet of Things (IIoT) by offloading tasks to nearby edge nodes. Meanwhile, the increasing network size makes it impractical for centralized data processing due to limited bandwidth, and consequently a decentralized learning scheme is preferable. Reinforcement learning (RL) has been widely investigated and shown to be a promising solution for decision-making and optimal control processes. For RL in a decentralized setup, edge nodes (agents) connected through a communication network aim to work collaboratively to find a policy to optimize the global reward as the sum of local rewards. However, communication costs, scalability and adaptation in complex environments with heterogeneous agents may significantly limit the performance of decentralized RL. Alternating direction method of multipliers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · Energy Harvesting in Wireless Networks · Advanced MIMO Systems Optimization
MethodsAlternating Direction Method of Multipliers
