Centralized & Distributed Deep Reinforcement Learning Methods for   Downlink Sum-Rate Optimization

Ahmad Ali Khan; Raviraj Adve

arXiv:2009.03033·cs.IT·September 15, 2020

Centralized & Distributed Deep Reinforcement Learning Methods for Downlink Sum-Rate Optimization

Ahmad Ali Khan, Raviraj Adve

PDF

Open Access

TL;DR

This paper introduces centralized and distributed deep reinforcement learning methods for optimizing downlink sum-rate in multi-cell networks, achieving higher efficiency and faster execution than traditional algorithms.

Contribution

It presents novel trust region-based DRL algorithms for multi-agent and single-agent settings, improving convergence and enabling distributed optimization with limited CSI.

Findings

01

Higher spectral efficiency than WMMSE and FP algorithms

02

Execution times over 100 times faster than traditional methods

03

Decentralized DRL approaches with limited CSI perform competitively

Abstract

For a multi-cell, multi-user, cellular network downlink sum-rate maximization through power allocation is a nonconvex and NP-hard optimization problem. In this paper, we present an effective approach to solving this problem through single- and multi-agent actor-critic deep reinforcement learning (DRL). Specifically, we use finite-horizon trust region optimization. Through extensive simulations, we show that we can simultaneously achieve higher spectral efficiency than state-of-the-art optimization algorithms like weighted minimum mean-squared error (WMMSE) and fractional programming (FP), while offering execution times more than two orders of magnitude faster than these approaches. Additionally, the proposed trust region methods demonstrate superior performance and convergence properties than the Advantage Actor-Critic (A2C) DRL algorithm. In contrast to prior approaches, the proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSmart Grid Security and Resilience · Viral Infections and Vectors · Adaptive Dynamic Programming Control