Towards a Sample Efficient Reinforcement Learning Pipeline for Vision Based Robotics
Maxence Mahe, Pierre Belamri, Jesus Bujalance Martin

TL;DR
This paper proposes an efficient reinforcement learning pipeline for vision-based robotic control, combining computer vision for relevant feature extraction with faster training methods to enable a robotic arm to reach a target quickly.
Contribution
It introduces a streamlined pipeline integrating computer vision and accelerated reinforcement learning for efficient robotic arm training from RGB video.
Findings
Reduced training time for robotic arm reaching tasks
Effective extraction of relevant visual information from RGB videos
Demonstrated faster policy learning in a real-world robotic setup
Abstract
Deep Reinforcement learning holds the guarantee of empowering self-ruling robots to master enormous collections of conduct abilities with negligible human mediation. The improvements brought by this technique enables robots to perform difficult tasks such as grabbing or reaching targets. Nevertheless, the training process is still time consuming and tedious especially when learning policies only with RGB camera information. This way of learning is capital to transfer the task from simulation to the real world since the only external source of information for the robot in real life is video. In this paper, we study how to limit the time taken for training a robotic arm with 6 Degrees Of Freedom (DOF) to reach a ball from scratch by assembling a pipeline as efficient as possible. The pipeline is divided into two parts: the first one is to capture the relevant information from the RGB…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning
