Value-Informed Skill Chaining for Policy Learning of Long-Horizon Tasks   with Surgical Robot

Tao Huang; Kai Chen; Wang Wei; Jianan Li; Yonghao Long; Qi Dou

arXiv:2307.16503·cs.RO·August 1, 2023

Value-Informed Skill Chaining for Policy Learning of Long-Horizon Tasks with Surgical Robot

Tao Huang, Kai Chen, Wang Wei, Jianan Li, Yonghao Long, Qi Dou

PDF

Open Access 1 Repo

TL;DR

This paper introduces ViSkill, a reinforcement learning framework that improves long-horizon surgical robot task performance by selecting optimal termination states for subtasks based on a value function, enhancing policy connection and success.

Contribution

The work proposes a novel value-informed skill chaining method that effectively guides subtask termination for better policy connection in surgical robot tasks.

Findings

01

Achieves high success rates on complex surgical tasks.

02

Improves policy stability and execution efficiency.

03

Demonstrates effectiveness on SurRoL simulation platform.

Abstract

Reinforcement learning is still struggling with solving long-horizon surgical robot tasks which involve multiple steps over an extended duration of time due to the policy exploration challenge. Recent methods try to tackle this problem by skill chaining, in which the long-horizon task is decomposed into multiple subtasks for easing the exploration burden and subtask policies are temporally connected to complete the whole long-horizon task. However, smoothly connecting all subtask policies is difficult for surgical robot scenarios. Not all states are equally suitable for connecting two adjacent subtasks. An undesired terminate state of the previous subtask would make the current subtask policy unstable and result in a failed execution. In this work, we introduce value-informed skill chaining (ViSkill), a novel reinforcement learning framework for long-horizon surgical robot tasks. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

med-air/viskill
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSurgical Simulation and Training · Aortic aneurysm repair treatments · Artificial Intelligence in Healthcare and Education