Modular Transfer Learning with Transition Mismatch Compensation for   Excessive Disturbance Rejection

Tianming Wang; Wenjie Lu; Huan Yu; Dikai Liu

arXiv:2007.14646·cs.RO·July 30, 2020

Modular Transfer Learning with Transition Mismatch Compensation for Excessive Disturbance Rejection

Tianming Wang, Wenjie Lu, Huan Yu, Dikai Liu

PDF

Open Access

TL;DR

This paper introduces a transfer learning framework with transition mismatch compensation for underwater robots to effectively reject excessive disturbances despite model mismatches, improving stability and sample efficiency.

Contribution

It proposes a modular transfer learning approach combining GCP and ODI, along with a novel TMC algorithm to adapt policies for disturbance rejection under dynamics mismatch.

Findings

01

Successfully rejects disturbances in simulation

02

Improves sample efficiency in training

03

Enhances stability of underwater robot control

Abstract

Underwater robots in shallow waters usually suffer from strong wave forces, which may frequently exceed robot's control constraints. Learning-based controllers are suitable for disturbance rejection control, but the excessive disturbances heavily affect the state transition in Markov Decision Process (MDP) or Partially Observable Markov Decision Process (POMDP). Also, pure learning procedures on targeted system may encounter damaging exploratory actions or unpredictable system variations, and training exclusively on a prior model usually cannot address model mismatch from the targeted system. In this paper, we propose a transfer learning framework that adapts a control policy for excessive disturbance rejection of an underwater robot under dynamics model mismatch. A modular network of learning policies is applied, composed of a Generalized Control Policy (GCP) and an Online Disturbance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Underwater Vehicles and Communication Systems · Domain Adaptation and Few-Shot Learning