An Overlay Multicast Routing Method Based on Network Situational Awareness and Hierarchical Multi-Agent Reinforcement Learning

Miao Ye; Yanye Chen; Yong Wang; Cheng Zhu; Qiuxiang Jiang; Gai Huang; Feng Ding

arXiv:2602.13211·cs.NI·April 24, 2026

An Overlay Multicast Routing Method Based on Network Situational Awareness and Hierarchical Multi-Agent Reinforcement Learning

Miao Ye, Yanye Chen, Yong Wang, Cheng Zhu, Qiuxiang Jiang, Gai Huang, Feng Ding

PDF

TL;DR

This paper introduces MA-DHRL-OM, a hierarchical reinforcement learning-based overlay multicast routing method that improves adaptability, stability, and performance in dynamic networks by leveraging SDN's global view.

Contribution

It presents a novel multi-agent deep hierarchical reinforcement learning approach that decomposes multicast routing into stages, enhancing scalability and convergence stability.

Findings

01

Outperforms existing methods in delay, bandwidth utilization, and packet loss.

02

Achieves more stable convergence and flexible routing.

03

Utilizes SDN's global view for traffic-aware path planning.

Abstract

Compared with IP multicast, Overlay Multicast (OM) offers better compatibility and flexible deployment in heterogeneous, cross-domain networks. However, traditional OM struggles to adapt to dynamic traffic due to unawareness of physical resource states, and existing reinforcement learning methods fail to decouple OM's tightly coupled multi-objective nature, leading to high complexity, slow convergence, and instability. To address this, we propose MA-DHRL-OM, a multi-agent deep hierarchical reinforcement learning approach. Using SDN's global view, it builds a traffic-aware model for OM path planning. The method decomposes OM tree construction into two stages via hierarchical agents, reducing action space and improving convergence stability. Multi-agent collaboration balances multi-objective optimization while enhancing scalability and adaptability. Experiments show MA-DHRL-OM outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.