Reinforcement learning for pursuit and evasion of microswimmers at low   Reynolds number

Francesco Borra; Luca Biferale; Massimo Cencini; Antonio; Celani

arXiv:2106.08609·physics.flu-dyn·March 4, 2022

Reinforcement learning for pursuit and evasion of microswimmers at low Reynolds number

Francesco Borra, Luca Biferale, Massimo Cencini, Antonio, Celani

PDF

TL;DR

This paper models pursuit-evasion between microswimmers in a low-Reynolds-number fluid environment, demonstrating that adversarial reinforcement learning enables agents to develop complex strategies that outperform heuristics despite partial information.

Contribution

It introduces a reinforcement learning framework for pursuit-evasion in microswimmers, revealing how agents learn sophisticated tactics in a hydrodynamic setting with partial observability.

Findings

01

Agents outperform heuristic strategies.

02

Reinforcement learning discovers complex pursuit-evasion tactics.

03

Agents effectively handle partial information through learned behaviors.

Abstract

We consider a model of two competing microswimming agents engaged in a pursue-evasion task within a low-Reynolds-number environment. Agents can only perform simple maneuvers and sense hydrodynamic disturbances, which provide ambiguous (partial) information about the opponent's position and motion. We frame the problem as a zero-sum game: The pursuer has to capture the evader in the shortest time, while the evader aims at deferring capture as long as possible. We show that the agents, trained via adversarial reinforcement learning, are able to overcome partial observability by discovering increasingly complex sequences of moves and countermoves that outperform known heuristic strategies and exploit the hydrodynamic environment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.