TL;DR
This paper evaluates various deep reinforcement learning algorithms for vision-based robotic grasping using a simulated benchmark, highlighting the effectiveness of simple methods and analyzing stability and generalization to unseen objects.
Contribution
It introduces a simulated benchmark for off-policy learning in robotic grasping and compares multiple algorithms, including a novel Monte Carlo and off-policy correction approach.
Findings
Simple methods perform competitively with complex algorithms.
Off-policy learning enhances generalization to unseen objects.
Stability analysis reveals tradeoffs among algorithms.
Abstract
In this paper, we explore deep reinforcement learning algorithms for vision-based robotic grasping. Model-free deep reinforcement learning (RL) has been successfully applied to a range of challenging environments, but the proliferation of algorithms makes it difficult to discern which particular approach would be best suited for a rich, diverse task like grasping. To answer this question, we propose a simulated benchmark for robotic grasping that emphasizes off-policy learning and generalization to unseen objects. Off-policy learning enables utilization of grasping data over a wide variety of objects, and diversity is important to enable the method to generalize to new objects that were not seen during training. We evaluate the benchmark tasks against a variety of Q-function estimation methods, a method previously proposed for robotic grasping with deep neural network models, and a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
