Decentralized Shepherding of Non-Cohesive Swarms Through Cluttered Environments via Deep Reinforcement Learning

Cristiana Punzo; Italo Napolitano; Cinzia Tomaselli; Mario di Bernardo

arXiv:2511.21405·eess.SY·November 27, 2025

Decentralized Shepherding of Non-Cohesive Swarms Through Cluttered Environments via Deep Reinforcement Learning

Cristiana Punzo, Italo Napolitano, Cinzia Tomaselli, Mario di Bernardo

PDF

Open Access

TL;DR

This paper presents a decentralized reinforcement learning approach for shepherding non-cohesive swarms through cluttered environments, enabling scalable and collision-free navigation without retraining in complex scenarios.

Contribution

It introduces a hierarchical control architecture combining target assignment with a learned low-level steering policy that generalizes from single-target to multi-target environments.

Findings

01

Smooth, collision-free trajectories achieved in simulations

02

Method generalizes to multiple obstacles without retraining

03

Consistent convergence to goal regions demonstrated

Abstract

This paper investigates decentralized shepherding in cluttered environments, where a limited number of herders must guide a larger group of non-cohesive, diffusive targets toward a goal region in the presence of static obstacles. A hierarchical control architecture is proposed, integrating a high-level target assignment rule, where each herder is paired with a selected target, with a learning-based low-level driving module that enables effective steering of the assigned target. The low-level policy is trained in a one-herder-one-target scenario with a rectangular obstacle using Proximal Policy Optimization and then directly extended to multi-agent settings with multiple obstacles without requiring retraining. Numerical simulations demonstrate smooth, collision-free trajectories and consistent convergence to the goal region, highlighting the potential of reinforcement learning for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Control Multi-Agent Systems · Reinforcement Learning in Robotics · Guidance and Control Systems