Multi-Access Communications with Energy Harvesting: A Multi-Armed Bandit Model and the Optimality of the Myopic Policy
Pol Blasco, Deniz Gunduz

TL;DR
This paper models multi-access wireless networks with energy harvesting nodes as a restless multi-armed bandit problem, proving the optimality of a myopic scheduling policy under certain conditions and analyzing its performance.
Contribution
It formulates the scheduling problem as an RMAB, derives an upper bound, and proves the optimality of the myopic policy in specific scenarios.
Findings
Myopic policy is optimal under certain assumptions.
An upper bound on the optimal scheduling policy is derived.
Numerical results show the performance of the myopic policy relative to the bound.
Abstract
A multi-access wireless network with N transmitting nodes, each equipped with an energy harvesting (EH) device and a rechargeable battery of finite capacity, is studied. At each time slot (TS) a node is operative with a certain probability, which may depend on the availability of data, or the state of its channel. The energy arrival process at each node is modelled as an independent two-state Markov process, such that, at each TS, a node either harvests one unit of energy, or none. At each TS a subset of the nodes is scheduled by the access point (AP). The scheduling policy that maximises the total throughput is studied assuming that the AP does not know the states of either the EH processes or the batteries. The problem is identified as a restless multiarmed bandit (RMAB) problem, and an upper bound on the optimal scheduling policy is found. Under certain assumptions regarding the EH…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnergy Harvesting in Wireless Networks · Advanced MIMO Systems Optimization · Age of Information Optimization
MethodsSpatio-temporal stability analysis
