Coexistence between Task- and Data-Oriented Communications: A Whittle's   Index Guided Multi-Agent Reinforcement Learning Approach

Ran Li; Chuan Huang; Xiaoqi Qin; Shengpei Jiang; Nan Ma; and Shuguang; Cui

arXiv:2205.09377·cs.IT·May 20, 2022

Coexistence between Task- and Data-Oriented Communications: A Whittle's Index Guided Multi-Agent Reinforcement Learning Approach

Ran Li, Chuan Huang, Xiaoqi Qin, Shengpei Jiang, Nan Ma, and Shuguang, Cui

PDF

Open Access

TL;DR

This paper introduces a novel multi-agent reinforcement learning approach guided by Whittle's index to optimize coexistence of task- and data-oriented communications in IoT systems, effectively managing shared channels and improving performance metrics.

Contribution

It develops a Whittle's index guided multi-agent proximal policy optimization algorithm for joint scheduling, addressing large action spaces and time-varying constraints in IoT communication systems.

Findings

01

The proposed WI-MAPPO algorithm outperforms existing AoI-based methods.

02

The approach effectively manages shared channels in resource-constrained IoT scenarios.

03

Numerical results demonstrate significant performance improvements.

Abstract

We investigate the coexistence of task-oriented and data-oriented communications in a IoT system that shares a group of channels, and study the scheduling problem to jointly optimize the weighted age of incorrect information (AoII) and throughput, which are the performance metrics of the two types of communications, respectively. This problem is formulated as a Markov decision problem, which is difficult to solve due to the large discrete action space and the time-varying action constraints induced by the stochastic availability of channels. By exploiting the intrinsic properties of this problem and reformulating the reward function based on channel statistics, we first simplify the solution space, state space, and optimality criteria, and convert it to an equivalent Markov game, for which the large discrete action space issue is greatly relieved. Then, we propose a Whittle's index…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAge of Information Optimization · IoT and Edge/Fog Computing · Cognitive Functions and Memory