Online Learning Algorithms for Stochastic Water-Filling
Yi Gai, Bhaskar Krishnamachari

TL;DR
This paper introduces online learning algorithms for stochastic water-filling, enabling power allocation in time-varying channels without prior knowledge, and demonstrates their asymptotic optimality and convergence properties.
Contribution
It proposes two novel algorithms, CWF1 and CWF2, for stochastic water-filling, addressing unknown channel distributions and exploiting non-linear dependencies, respectively.
Findings
CWF1 achieves polynomial regret growth in channels and logarithmic in time.
CWF2 bounds incorrect allocations, ensuring they diminish over time.
Algorithms asymptotically reach optimal rate without prior channel knowledge.
Abstract
Water-filling is the term for the classic solution to the problem of allocating constrained power to a set of parallel channels to maximize the total data-rate. It is used widely in practice, for example, for power allocation to sub-carriers in multi-user OFDM systems such as WiMax. The classic water-filling algorithm is deterministic and requires perfect knowledge of the channel gain to noise ratios. In this paper we consider how to do power allocation over stochastically time-varying (i.i.d.) channels with unknown gain to noise ratio distributions. We adopt an online learning framework based on stochastic multi-armed bandits. We consider two variations of the problem, one in which the goal is to find a power allocation to maximize , and another in which the goal is to find a power allocation to maximize $\sum\limits_i \log(1 +…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management · Advanced Wireless Network Optimization
