Collaborative Policy Learning for Dynamic Scheduling Tasks in   Cloud-Edge-Terminal IoT Networks Using Federated Reinforcement Learning

Do-Yup Kim; Da-Eun Lee; Ji-Wan Kim; Hyun-Suk Lee

arXiv:2307.00541·cs.LG·July 4, 2023

Collaborative Policy Learning for Dynamic Scheduling Tasks in Cloud-Edge-Terminal IoT Networks Using Federated Reinforcement Learning

Do-Yup Kim, Da-Eun Lee, Ji-Wan Kim, Hyun-Suk Lee

PDF

Open Access

TL;DR

This paper introduces a federated reinforcement learning framework for collaborative dynamic scheduling in cloud-edge-terminal IoT networks, improving learning efficiency and fairness across tasks and edges.

Contribution

It proposes a novel adaptive collaborative policy learning framework with an edge-agnostic policy structure for IoT networks, along with convergence analysis.

Findings

01

Accelerates policy learning speed.

02

Enhances adaptation for new edges.

03

Outperforms non-collaborative approaches.

Abstract

In this paper, we examine cloud-edge-terminal IoT networks, where edges undertake a range of typical dynamic scheduling tasks. In these IoT networks, a central policy for each task can be constructed at a cloud server. The central policy can be then used by the edges conducting the task, thereby mitigating the need for them to learn their own policy from scratch. Furthermore, this central policy can be collaboratively learned at the cloud server by aggregating local experiences from the edges, thanks to the hierarchical architecture of the IoT networks. To this end, we propose a novel collaborative policy learning framework for dynamic scheduling tasks using federated reinforcement learning. For effective learning, our framework adaptively selects the tasks for collaborative learning in each round, taking into account the need for fairness among tasks. In addition, as a key enabler of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIoT and Edge/Fog Computing · Age of Information Optimization · Privacy-Preserving Technologies in Data

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings