Primitive Skill-based Robot Learning from Human Evaluative Feedback

Ayano Hiranaka; Minjune Hwang; Sharon Lee; Chen Wang; Li Fei-Fei,; Jiajun Wu; Ruohan Zhang

arXiv:2307.15801·cs.RO·August 3, 2023·1 cites

Primitive Skill-based Robot Learning from Human Evaluative Feedback

Ayano Hiranaka, Minjune Hwang, Sharon Lee, Chen Wang, Li Fei-Fei,, Jiajun Wu, Ruohan Zhang

PDF

Open Access

TL;DR

The paper introduces SEED, a novel framework combining primitive skill-based reinforcement learning and human feedback to improve sample efficiency, safety, and reduce human effort in robot manipulation tasks.

Contribution

SEED is the first framework to integrate primitive skills with RLHF, enhancing safety and efficiency in long-horizon robot learning.

Findings

01

SEED outperforms state-of-the-art RL algorithms in sample efficiency.

02

SEED significantly reduces human effort compared to existing RLHF methods.

03

SEED demonstrates superior safety and effectiveness across five manipulation tasks.

Abstract

Reinforcement learning (RL) algorithms face significant challenges when dealing with long-horizon robot manipulation tasks in real-world environments due to sample inefficiency and safety issues. To overcome these challenges, we propose a novel framework, SEED, which leverages two approaches: reinforcement learning from human feedback (RLHF) and primitive skill-based reinforcement learning. Both approaches are particularly effective in addressing sparse reward issues and the complexities involved in long-horizon tasks. By combining them, SEED reduces the human effort required in RLHF and increases safety in training robot manipulation with RL in real-world settings. Additionally, parameterized skills provide a clear view of the agent's high-level intentions, allowing humans to evaluate skill choices before they are executed. This feature makes the training process even safer and more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · EEG and Brain-Computer Interfaces