Learning swimming via deep reinforcement learning
Jin Zhang, Lei Zhou, Bochao Cao

TL;DR
This paper combines variational autoencoders and reinforcement learning to discover efficient flapping motions for underwater propulsion, demonstrating harmonic motions as optimal for energy-efficient swimming.
Contribution
It introduces a novel RL framework with motion compression via VAE, enabling autonomous discovery of efficient swimming patterns without predefined motion constraints.
Findings
Harmonic motions are optimal for hydrodynamic efficiency.
RL converges to harmonic-like motion patterns after training.
Proposed framework can extend to other complex swimming problems.
Abstract
For decades, people have been seeking for fishlike flapping motions that can realize underwater propulsion with low energy cost. Complexity of the nonstationary flow field around the flapping body makes this problem very difficult. In earlier studies, motion patterns are usually prescribed as certain periodic functions which constrains the following optimization process in a small subdomain of the whole motion space. In this work, to avoid this motion constraint, a variational autoencoder (VAE) is designed to compress various flapping motions into a simple action vector. Then we let a flapping airfoil continuously interact with water tunnel environment and adjust its action accordingly through a reinforcement learning (RL) framework. By this automatic close-looped experiment, we obtain several motion patterns that can result in high hydrodynamic efficiency comparing to pure harmonic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomimetic flight and propulsion mechanisms · Model Reduction and Neural Networks · Reinforcement Learning in Robotics
