Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection
Tianming Wang, Wenjie Lu, Huan Yu, Dikai Liu

TL;DR
This paper introduces a transfer learning framework with transition mismatch compensation for underwater robots to effectively reject excessive disturbances despite model mismatches, improving stability and sample efficiency.
Contribution
It proposes a modular transfer learning approach combining GCP and ODI, along with a novel TMC algorithm to adapt policies for disturbance rejection under dynamics mismatch.
Findings
Successfully rejects disturbances in simulation
Improves sample efficiency in training
Enhances stability of underwater robot control
Abstract
Underwater robots in shallow waters usually suffer from strong wave forces, which may frequently exceed robot's control constraints. Learning-based controllers are suitable for disturbance rejection control, but the excessive disturbances heavily affect the state transition in Markov Decision Process (MDP) or Partially Observable Markov Decision Process (POMDP). Also, pure learning procedures on targeted system may encounter damaging exploratory actions or unpredictable system variations, and training exclusively on a prior model usually cannot address model mismatch from the targeted system. In this paper, we propose a transfer learning framework that adapts a control policy for excessive disturbance rejection of an underwater robot under dynamics model mismatch. A modular network of learning policies is applied, composed of a Generalized Control Policy (GCP) and an Online Disturbance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Underwater Vehicles and Communication Systems · Domain Adaptation and Few-Shot Learning
