Spatial-Temporal Reinforcement Learning for Network Routing with Non-Markovian Traffic

Molly Wang; Kin.K Leung

arXiv:2507.22174·cs.LG·August 1, 2025

Spatial-Temporal Reinforcement Learning for Network Routing with Non-Markovian Traffic

Molly Wang, Kin.K Leung

PDF

Open Access

TL;DR

This paper introduces a spatial-temporal reinforcement learning framework for network routing that effectively handles non-Markovian traffic patterns and exploits network topology structure, outperforming traditional methods.

Contribution

The paper presents a novel STRL framework that models non-Markovian traffic and spatial network features, improving routing performance over existing RL approaches.

Findings

01

Achieves over 19% improvement during training.

02

Attains more than 7% better inference accuracy.

03

Handles non-Markovian traffic effectively.

Abstract

Reinforcement Learning (RL) has been widely used for packet routing in communication networks, but traditional RL methods rely on the Markov assumption that the current state contains all necessary information for decision-making. In reality, internet traffic is non-Markovian, and past states do influence routing performance. Moreover, common deep RL approaches use function approximators, such as neural networks, that do not model the spatial structure in network topologies. To address these shortcomings, we design a network environment with non-Markovian traffic and introduce a spatial-temporal RL (STRL) framework for packet routing. Our approach outperforms traditional baselines by more than 19% during training and 7% for inference despite a change in network topology.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Control Multi-Agent Systems · Energy Efficient Wireless Sensor Networks · Advanced MIMO Systems Optimization