Towards Safe Control of Continuum Manipulator Using Shielded Multiagent   Reinforcement Learning

Guanglin Ji; Junyan Yan; Jingxin Du; Wanquan Yan; Jibiao Chen,; Yongkang Lu; Juan Rojas; and Shing Shin Cheng

arXiv:2106.07892·cs.RO·June 16, 2021·5 cites

Towards Safe Control of Continuum Manipulator Using Shielded Multiagent Reinforcement Learning

Guanglin Ji, Junyan Yan, Jingxin Du, Wanquan Yan, Jibiao Chen,, Yongkang Lu, Juan Rojas, and Shing Shin Cheng

PDF

Open Access

TL;DR

This paper demonstrates that a shielded multiagent deep Q network can safely and accurately control continuum surgical robots under complex interaction scenarios, improving control performance without explicit modeling.

Contribution

It introduces a shielded MADQN framework for safe, model-free control of continuum robots, addressing nonlinearities and external interactions effectively.

Findings

01

Achieved submillimeter trajectory tracking accuracy.

02

Enabled safe control under external loads and collisions.

03

Validated on both 2-DoF and miniature continuum robots.

Abstract

Continuum robotic manipulators are increasingly adopted in minimal invasive surgery. However, their nonlinear behavior is challenging to model accurately, especially when subject to external interaction, potentially leading to poor control performance. In this letter, we investigate the feasibility of adopting a model-free multiagent reinforcement learning (RL), namely multiagent deep Q network (MADQN), to control a 2-degree of freedom (DoF) cable-driven continuum surgical manipulator. The control of the robot is formulated as a one-DoF, one agent problem in the MADQN framework to improve the learning efficiency. Combined with a shielding scheme that enables dynamic variation of the action set boundary, MADQN leads to efficient and importantly safer control of the robot. Shielded MADQN enabled the robot to perform point and trajectory tracking with submillimeter root mean square errors…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoft Robotics and Applications · Prosthetics and Rehabilitation Robotics · Robot Manipulation and Learning