Survivable Hyper-Redundant Robotic Arm with Bayesian Policy Morphing

Sayyed Jaffar Ali Raza; Apan Dastider; Mingjie Lin

arXiv:2010.09964·cs.RO·October 21, 2020

Survivable Hyper-Redundant Robotic Arm with Bayesian Policy Morphing

Sayyed Jaffar Ali Raza, Apan Dastider, Mingjie Lin

PDF

TL;DR

This paper introduces a Bayesian reinforcement learning framework called Bayesian Policy Morphing that enables robotic arms to adaptively recover from mechanical failures by self-modifying their policies, enhancing survivability.

Contribution

The paper proposes a novel Bayesian Policy Morphing method that extends actor-critic algorithms for adaptive policy recovery in damaged robotic manipulators.

Findings

01

Robotic arm maintains functionality despite joint damages.

02

Bayesian Policy Morphing improves learning efficiency with fewer samples.

03

Successful demonstration on an 8-DOF robotic arm with various joint damages.

Abstract

In this paper we present a Bayesian reinforcement learning framework that allows robotic manipulators to adaptively recover from random mechanical failures autonomously, hence being survivable. To this end, we formulate the framework of Bayesian Policy Morphing (BPM) that enables a robot agent to self-modify its learned policy after the diminution of its maneuvering dimensionality. We build upon existing actor-critic framework, and extend it to perform policy gradient updates as posterior learning, taking past policy updates as prior distributions. We show that policy search, in the direction biased by prior experience, significantly improves learning efficiency in terms of sampling requirements. We demonstrate our results on an 8-DOF robotic arm with our algorithm of BPM, while intentionally disabling random joints with different damage types like unresponsive joints, constant offset…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.