A critical assessment of reinforcement learning methods for microswimmer navigation in complex flows

Selim Mecanna; Aurore Loisy; Christophe Eloy

arXiv:2505.05525·cs.LG·October 28, 2025

A critical assessment of reinforcement learning methods for microswimmer navigation in complex flows

Selim Mecanna, Aurore Loisy, Christophe Eloy

PDF

1 Repo

TL;DR

This paper critically evaluates reinforcement learning algorithms for microswimmer navigation in complex fluid flows, demonstrating that advanced methods like PPO outperform traditional algorithms and can achieve near-optimal performance with proper tuning.

Contribution

It provides a comprehensive assessment of RL methods in fluid navigation, highlighting the superiority of PPO and emphasizing the importance of implementation details and hyperparameter tuning.

Findings

01

PPO outperforms Q-Learning and Advantage Actor Critic in complex flows.

02

Proper tuning and techniques enable PPO to match theoretical optimal performance.

03

Traditional RL algorithms show poor robustness in turbulent flow environments.

Abstract

Navigating in a fluid flow while being carried by it, using only information accessible from on-board sensors, is a problem commonly faced by small planktonic organisms. It is also directly relevant to autonomous robots deployed in the oceans. In the last ten years, the fluid mechanics community has widely adopted reinforcement learning, often in the form of its simplest implementations, to address this challenge. But it is unclear how good are the strategies learned by these algorithms. In this paper, we perform a quantitative assessment of reinforcement learning methods applied to navigation in partially observable flows. We first introduce a well-posed problem of directional navigation for which a quasi-optimal policy is known analytically. We then report on the poor performance and robustness of commonly used algorithms (Q-Learning, Advantage Actor Critic) in flows regularly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

c0pep0d/rlfl0w
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsEntropy Regularization · Proximal Policy Optimization