Dynamic Resource Configuration for Low-Power IoT Networks: A Multi-Objective Reinforcement Learning Method
Yang Huang, Caiyong Hao, Yijie Mao, and Fuhui Zhou

TL;DR
This paper introduces a multi-objective reinforcement learning approach for dynamic resource configuration in low-power IoT networks, significantly improving energy efficiency and decision accuracy under interference conditions.
Contribution
It proposes a novel MORL-based scheme for resource configuration, addressing limitations of single-objective methods in interference-rich environments.
Findings
Decision error rate reduced to less than 12% of conventional methods
MORL scheme outperforms R-learning in energy efficiency
Enhanced adaptability to interference in IoT networks
Abstract
Considering grant-free transmissions in low-power IoT networks with unknown time-frequency distribution of interference, we address the problem of Dynamic Resource Configuration (DRC), which amounts to a Markov decision process. Unfortunately, off-the-shelf methods based on single-objective reinforcement learning cannot guarantee energy-efficient transmission, especially when all frequency-domain channels in a time interval are interfered. Therefore, we propose a novel DRC scheme where configuration policies are optimized with a Multi-Objective Reinforcement Learning (MORL) framework. Numerical results show that the average decision error rate achieved by the MORL-based DRC can be even less than 12% of that yielded by the conventional R-learning-based approach.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
