# Deep Multi-Agent Reinforcement Learning Based Cooperative Edge Caching   in Wireless Networks

**Authors:** Chen Zhong, M. Cenk Gursoy, and Senem Velipasalar

arXiv: 1905.05256 · 2019-05-15

## TL;DR

This paper introduces a deep multi-agent reinforcement learning framework for cooperative edge caching in wireless networks, aiming to reduce transmission delay by dynamically learning optimal caching policies at base stations.

## Contribution

It presents a novel deep actor-critic multi-agent approach for edge caching with unknown content popularity, outperforming traditional caching policies in delay reduction.

## Key findings

- Significant delay reduction compared to LRU, LFU, FIFO policies.
- Demonstrates adaptability to changing network environments.
- Outperforms traditional caching algorithms in simulations.

## Abstract

The growing demand on high-quality and low-latency multimedia services has led to much interest in edge caching techniques. Motivated by this, we in this paper consider edge caching at the base stations with unknown content popularity distributions. To solve the dynamic control problem of making caching decisions, we propose a deep actor-critic reinforcement learning based multi-agent framework with the aim to minimize the overall average transmission delay. To evaluate the proposed framework, we compare the learning-based performance with three other caching policies, namely least recently used (LRU), least frequently used (LFU), and first-in-first-out (FIFO) policies. Through simulation results, performance improvements of the proposed framework over these three caching algorithms have been identified and its superior ability to adapt to varying environments is demonstrated.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.05256/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1905.05256/full.md

## References

15 references — full list in the complete paper: https://tomesphere.com/paper/1905.05256/full.md

---
Source: https://tomesphere.com/paper/1905.05256