Predator-prey survival pressure is sufficient to evolve swarming behaviors
Jianan Li, Liang Li, Shiyu Zhao

TL;DR
This study demonstrates that simple survival-based rewards in a predator-prey reinforcement learning framework can lead to diverse emergent swarming and dispersal behaviors, offering insights into biological collective behavior and potential robotic applications.
Contribution
Introduces a minimal predator-prey coevolution model using survival pressure-based rewards, revealing diverse emergent behaviors without handcrafted rules.
Findings
Prey exhibit flocking and swirling behaviors.
Predators develop dispersion and confusion tactics.
Emergent behaviors arise solely from survival-based rewards.
Abstract
The comprehension of how local interactions arise in global collective behavior is of utmost importance in both biological and physical research. Traditional agent-based models often rely on static rules that fail to capture the dynamic strategies of the biological world. Reinforcement learning has been proposed as a solution, but most previous methods adopt handcrafted reward functions that implicitly or explicitly encourage the emergence of swarming behaviors. In this study, we propose a minimal predator-prey coevolution framework based on mixed cooperative-competitive multiagent reinforcement learning, and adopt a reward function that is solely based on the fundamental survival pressure, that is, prey receive a reward of if caught by predators while predators receive a reward of . Surprisingly, our analysis of this approach reveals an unexpectedly rich diversity of emergent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodsfail
