A Stable On-line Algorithm for Energy Efficient Multi-user Scheduling
Nitin Salodkar, Abhay Karandikar, V.S. Borkar

TL;DR
This paper introduces a reinforcement learning-based online algorithm for energy-efficient multi-user uplink scheduling that adapts to unknown channel and arrival statistics while satisfying delay constraints.
Contribution
It proposes a novel distributed reinforcement learning approach for multi-user scheduling under delay constraints without prior statistical knowledge.
Findings
Algorithm converges and is stable.
Achieves energy efficiency while meeting delay constraints.
Effective in IEEE 802.16 system simulations.
Abstract
In this paper, we consider the problem of energy efficient uplink scheduling with delay constraint for a multi-user wireless system. We address this problem within the framework of constrained Markov decision processes (CMDPs) wherein one seeks to minimize one cost (average power) subject to a hard constraint on another (average delay). We do not assume the arrival and channel statistics to be known. To handle state space explosion and informational constraints, we split the problem into individual CMDPs for the users, coupled through their Lagrange multipliers; and a user selection problem at the base station. To address the issue of unknown channel and arrival statistics, we propose a reinforcement learning algorithm. The users use this learning algorithm to determine the rate at which they wish to transmit in a slot and communicate this to the base station. The base station then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Wireless Network Optimization · Advanced MIMO Systems Optimization · Cooperative Communication and Network Coding
