Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot
Tao Huang, Kai Chen, Bin Li, Yun-Hui Liu, Qi Dou

TL;DR
This paper introduces DEX, a demonstration-guided reinforcement learning algorithm that enhances exploration efficiency and success rates in surgical robot automation by effectively utilizing expert demonstrations, validated through simulation and real robot experiments.
Contribution
The paper presents a novel RL method that leverages expert demonstrations with higher value estimation and non-parametric regression to improve exploration in surgical automation tasks.
Findings
Significant improvement in exploration efficiency and success rates in simulation tasks.
Effective transfer of learned policies from simulation to real surgical robot.
Demonstrated robustness across 10 diverse surgical manipulation tasks.
Abstract
Task automation of surgical robot has the potentials to improve surgical efficiency. Recent reinforcement learning (RL) based approaches provide scalable solutions to surgical automation, but typically require extensive data collection to solve a task if no prior knowledge is given. This issue is known as the exploration challenge, which can be alleviated by providing expert demonstrations to an RL agent. Yet, how to make effective use of demonstration data to improve exploration efficiency still remains an open challenge. In this work, we introduce Demonstration-guided EXploration (DEX), an efficient reinforcement learning algorithm that aims to overcome the exploration problem with expert demonstrations for surgical automation. To effectively exploit demonstrations, our method estimates expert-like behaviors with higher values to facilitate productive interactions, and adopts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Surgical Simulation and Training
