TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning
Giuseppe Paolo, Abdelhakim Benechehab, Hamza Cherkaoui, Albert Thomas,, Bal\'azs K\'egl

TL;DR
This paper introduces TAG, a decentralized framework for multi-agent hierarchical reinforcement learning that supports arbitrary hierarchy depth, improving scalability and performance over traditional methods.
Contribution
The paper presents TAG, a novel framework enabling fully decentralized, multi-level hierarchical reinforcement learning with a standardized interface and flexible agent integration.
Findings
TAG improves learning speed and final performance.
Hierarchical architectures outperform classical multi-agent RL baselines.
Decentralized organization enhances scalability and adaptability.
Abstract
Hierarchical organization is fundamental to biological systems and human societies, yet artificial intelligence systems often rely on monolithic architectures that limit adaptability and scalability. Current hierarchical reinforcement learning (HRL) approaches typically restrict hierarchies to two levels or require centralized training, which limits their practical applicability. We introduce TAME Agent Framework (TAG), a framework for constructing fully decentralized hierarchical multi-agent systems. TAG enables hierarchies of arbitrary depth through a novel LevelEnv concept, which abstracts each hierarchy level as the environment for the agents above it. This approach standardizes information flow between levels while preserving loose coupling, allowing for seamless integration of diverse agent types. We demonstrate the effectiveness of TAG by implementing hierarchical architectures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSupply Chain and Inventory Management · Reinforcement Learning in Robotics
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
