Reinforcement Learning for Vision-based Object Manipulation with   Non-parametric Policy and Action Primitives

Dongwon Son; Myungsin Kim; Jaecheol Sim; Wonsik Shin

arXiv:2206.05671·cs.RO·June 22, 2022

Reinforcement Learning for Vision-based Object Manipulation with Non-parametric Policy and Action Primitives

Dongwon Son, Myungsin Kim, Jaecheol Sim, Wonsik Shin

PDF

TL;DR

This paper introduces AP-NPQL, a novel reinforcement learning framework utilizing non-parametric policies and action primitives, which improves efficiency and success rates in vision-based object manipulation tasks and transfers well to real robots.

Contribution

The paper presents a new non-parametric Q-learning approach with action primitives for efficient visual object manipulation, outperforming existing parametric methods.

Findings

01

Outperforms state-of-the-art algorithms in simulation tasks.

02

Achieves higher learning efficiency and success rates.

03

Successfully transfers policies from simulation to real robots.

Abstract

The object manipulation is a crucial ability for a service robot, but it is hard to solve with reinforcement learning due to some reasons such as sample efficiency. In this paper, to tackle this object manipulation, we propose a novel framework, AP-NPQL (Non-Parametric Q Learning with Action Primitives), that can efficiently solve the object manipulation with visual input and sparse reward, by utilizing a non-parametric policy for reinforcement learning and appropriate behavior prior for the object manipulation. We evaluate the efficiency and the performance of the proposed AP-NPQL for four object manipulation tasks on simulation (pushing plate, stacking box, flipping cup, and picking and placing plate), and it turns out that our AP-NPQL outperforms the state-of-the-art algorithms based on parametric policy and behavior prior in terms of learning time and task success rate. We also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.