Value-Informed Skill Chaining for Policy Learning of Long-Horizon Tasks with Surgical Robot
Tao Huang, Kai Chen, Wang Wei, Jianan Li, Yonghao Long, Qi Dou

TL;DR
This paper introduces ViSkill, a reinforcement learning framework that improves long-horizon surgical robot task performance by selecting optimal termination states for subtasks based on a value function, enhancing policy connection and success.
Contribution
The work proposes a novel value-informed skill chaining method that effectively guides subtask termination for better policy connection in surgical robot tasks.
Findings
Achieves high success rates on complex surgical tasks.
Improves policy stability and execution efficiency.
Demonstrates effectiveness on SurRoL simulation platform.
Abstract
Reinforcement learning is still struggling with solving long-horizon surgical robot tasks which involve multiple steps over an extended duration of time due to the policy exploration challenge. Recent methods try to tackle this problem by skill chaining, in which the long-horizon task is decomposed into multiple subtasks for easing the exploration burden and subtask policies are temporally connected to complete the whole long-horizon task. However, smoothly connecting all subtask policies is difficult for surgical robot scenarios. Not all states are equally suitable for connecting two adjacent subtasks. An undesired terminate state of the previous subtask would make the current subtask policy unstable and result in a failed execution. In this work, we introduce value-informed skill chaining (ViSkill), a novel reinforcement learning framework for long-horizon surgical robot tasks. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Aortic aneurysm repair treatments · Artificial Intelligence in Healthcare and Education
