Learning Efficient Navigation in Vortical Flow Fields
Peter Gunnarson, Ioannis Mandralis, Guido Novati, Petros Koumoutsakos,, John O. Dabiri

TL;DR
This paper presents a reinforcement learning approach enabling robots to efficiently navigate unsteady flow fields by exploiting environmental cues, achieving near-optimal success rates and time efficiency in complex ocean-like scenarios.
Contribution
The study introduces a novel RL algorithm that uses environmental cues and experience replay to learn navigation policies in unsteady flow fields, outperforming bio-mimetic sensing methods.
Findings
Velocity sensing outperforms vorticity sensing by nearly two-fold in success rate.
Reinforcement learning achieves near 100% success in reaching targets.
Navigation policies approach the efficiency of global optimal control paths.
Abstract
Efficient point-to-point navigation in the presence of a background flow field is important for robotic applications such as ocean surveying. In such applications, robots may only have knowledge of their immediate surroundings or be faced with time-varying currents, which limits the use of optimal control techniques for planning trajectories. Here, we apply a novel Reinforcement Learning algorithm to discover time-efficient navigation policies to steer a fixed-speed swimmer through an unsteady two-dimensional flow field. The algorithm entails inputting environmental cues into a deep neural network that determines the swimmer's actions, and deploying Remember and Forget Experience replay. We find that the resulting swimmers successfully exploit the background flow to reach the target, but that this success depends on the type of sensed environmental cue. Surprisingly, a velocity sensing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
