Predator-prey survival pressure is sufficient to evolve swarming   behaviors

Jianan Li; Liang Li; Shiyu Zhao

arXiv:2308.12624·q-bio.PE·August 25, 2023

Predator-prey survival pressure is sufficient to evolve swarming behaviors

Jianan Li, Liang Li, Shiyu Zhao

PDF

TL;DR

This study demonstrates that simple survival-based rewards in a predator-prey reinforcement learning framework can lead to diverse emergent swarming and dispersal behaviors, offering insights into biological collective behavior and potential robotic applications.

Contribution

Introduces a minimal predator-prey coevolution model using survival pressure-based rewards, revealing diverse emergent behaviors without handcrafted rules.

Findings

01

Prey exhibit flocking and swirling behaviors.

02

Predators develop dispersion and confusion tactics.

03

Emergent behaviors arise solely from survival-based rewards.

Abstract

The comprehension of how local interactions arise in global collective behavior is of utmost importance in both biological and physical research. Traditional agent-based models often rely on static rules that fail to capture the dynamic strategies of the biological world. Reinforcement learning has been proposed as a solution, but most previous methods adopt handcrafted reward functions that implicitly or explicitly encourage the emergence of swarming behaviors. In this study, we propose a minimal predator-prey coevolution framework based on mixed cooperative-competitive multiagent reinforcement learning, and adopt a reward function that is solely based on the fundamental survival pressure, that is, prey receive a reward of $- 1$ if caught by predators while predators receive a reward of $+ 1$ . Surprisingly, our analysis of this approach reveals an unexpectedly rich diversity of emergent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Methodsfail