# Reinforcement Learning for Self-Organization and Power Control of   Two-Tier Heterogeneous Networks

**Authors:** Roohollah Amiri, Mojtaba Ahmadi Almasi, Jeffrey G. Andrews, Hani, Mehrpouyan

arXiv: 1812.09778 · 2019-03-19

## TL;DR

This paper introduces a reinforcement learning-based framework for self-organizing power control in dense two-tier heterogeneous networks, enabling adaptive interference management and quality of service maintenance.

## Contribution

It proposes a distributed multi-agent Markov decision process model and a Q-learning algorithm for autonomous power optimization in HetNets.

## Key findings

- Q-DPA algorithm achieves near-optimal power control with high probability.
- The framework maintains macrocell user quality of service at high femtocell densities.
- Sample complexity bounds ensure efficient learning in dense deployments.

## Abstract

Self-organizing networks (SONs) can help manage the severe interference in dense heterogeneous networks (HetNets). Given their need to automatically configure power and other settings, machine learning is a promising tool for data-driven decision making in SONs. In this paper, a HetNet is modeled as a dense two-tier network with conventional macrocells overlaid with denser small cells (e.g. femto or pico cells). First, a distributed framework based on multi-agent Markov decision process is proposed that models the power optimization problem in the network. Second, we present a systematic approach for designing a reward function based on the optimization problem. Third, we introduce Q-learning based distributed power allocation algorithm (Q-DPA) as a self-organizing mechanism that enables ongoing transmit power adaptation as new small cells are added to the network. Further, the sample complexity of the Q-DPA algorithm to achieve $\epsilon$-optimality with high probability is provided. We demonstrate, at density of several thousands femtocells per km$^2$, the required quality of service of a macrocell user can be maintained via the proper selection of independent or cooperative learning and appropriate Markov state models.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1812.09778/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/1812.09778/full.md

## References

45 references — full list in the complete paper: https://tomesphere.com/paper/1812.09778/full.md

---
Source: https://tomesphere.com/paper/1812.09778