Hierarchical Learning-Based Control for Multi-Agent Shepherding of Stochastic Autonomous Agents
Italo Napolitano, Stefano Covone, Andrea Lama, Francesco De Lellis, Mario di Bernardo

TL;DR
This paper introduces a hierarchical, learning-based control system for multi-agent shepherding that effectively guides stochastic autonomous targets without relying on explicit communication, demonstrating superior performance and scalability.
Contribution
It proposes a novel decentralized control architecture that learns from experience and adapts to stochastic target behavior without prior knowledge or centralized supervision.
Findings
Achieves 100% success rate in guiding targets
Improves settling times and control efficiency
Demonstrates scalability and real-time implementation on Robotarium
Abstract
Multi-agent shepherding represents a challenging distributed control problem where herder agents must coordinate to guide independently moving targets to desired spatial configurations. Most existing control strategies assume cohesive target behavior, which frequently fails in practical applications where targets exhibit stochastic autonomous behavior. This paper presents a hierarchical learning-based control architecture that decomposes the shepherding problem into a high-level decision-making module and a low-level motion control component. The proposed distributed control system synthesizes effective control policies directly from closed-loop experience without requiring explicit inter-agent communication or prior knowledge of target dynamics. The decentralized architecture achieves cooperative control behavior through emergent coordination without centralized supervision.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · Reinforcement Learning in Robotics · Adaptive Dynamic Programming Control
