Transformer-Based Scalable Multi-Agent Reinforcement Learning for Networked Systems with Long-Range Interactions

Vidur Sinha; Muhammed Ustaomeroglu; Guannan Qu

arXiv:2511.13103·cs.LG·November 18, 2025

Transformer-Based Scalable Multi-Agent Reinforcement Learning for Networked Systems with Long-Range Interactions

Vidur Sinha, Muhammed Ustaomeroglu, Guannan Qu

PDF

Open Access

TL;DR

This paper introduces STACCA, a transformer-based multi-agent reinforcement learning framework that effectively models long-range dependencies and generalizes across different network topologies for large-scale network control tasks.

Contribution

STACCA is a novel transformer-based MARL framework that captures long-range interactions and generalizes across network structures, addressing key limitations of prior methods.

Findings

01

STACCA outperforms existing methods in epidemic containment tasks.

02

It demonstrates strong generalization across diverse network topologies.

03

The framework scales effectively to large networked systems.

Abstract

Multi-agent reinforcement learning (MARL) has shown promise for large-scale network control, yet existing methods face two major limitations. First, they typically rely on assumptions leading to decay properties of local agent interactions, limiting their ability to capture long-range dependencies such as cascading power failures or epidemic outbreaks. Second, most approaches lack generalizability across network topologies, requiring retraining when applied to new graphs. We introduce STACCA (Shared Transformer Actor-Critic with Counterfactual Advantage), a unified transformer-based MARL framework that addresses both challenges. STACCA employs a centralized Graph Transformer Critic to model long-range dependencies and provide system-level feedback, while its shared Graph Transformer Actor learns a generalizable policy capable of adapting across diverse network structures. Further, to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Graph Neural Networks · Software-Defined Networks and 5G