Causality Meets Locality: Provably Generalizable and Scalable Policy Learning for Networked Systems

Hao Liang; Shuqing Shi; Yudi Zhang; Biwei Huang; Yali Du

arXiv:2510.21427·cs.LG·October 27, 2025

Causality Meets Locality: Provably Generalizable and Scalable Policy Learning for Networked Systems

Hao Liang, Shuqing Shi, Yudi Zhang, Biwei Huang, Yali Du

PDF

1 Video

TL;DR

This paper introduces GSAC, a framework combining causal representation learning with meta actor-critic methods to enable scalable, generalizable policy learning in large networked systems with environment shifts.

Contribution

The paper proposes a novel GSAC framework that learns sparse local causal masks and compact domain factors, providing provable guarantees and efficient adaptation for networked systems.

Findings

01

GSAC achieves rapid adaptation to new domains with few trajectories.

02

The method outperforms learning-from-scratch and baseline approaches.

03

Finite-sample guarantees are established for causal recovery and policy convergence.

Abstract

Large-scale networked systems, such as traffic, power, and wireless grids, challenge reinforcement-learning agents with both scale and environment shifts. To address these challenges, we propose GSAC (Generalizable and Scalable Actor-Critic), a framework that couples causal representation learning with meta actor-critic learning to achieve both scalability and domain generalization. Each agent first learns a sparse local causal mask that provably identifies the minimal neighborhood variables influencing its dynamics, yielding exponentially tight approximately compact representations (ACRs) of state and domain factors. These ACRs bound the error of truncating value functions to $κ$ -hop neighborhoods, enabling efficient learning on graphs. A meta actor-critic then trains a shared policy across multiple source domains while conditioning on the compact domain factors; at test time, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Causality Meets Locality: Provably Generalizable and Scalable Policy Learning for Networked Systems· slideslive