Sample-efficient Reinforcement Learning in Robotic Table Tennis

Jonas Tebbe; Lukas Krauch; Yapeng Gao; Andreas Zell

arXiv:2011.03275·cs.RO·January 5, 2024

Sample-efficient Reinforcement Learning in Robotic Table Tennis

Jonas Tebbe, Lukas Krauch, Yapeng Gao, Andreas Zell

PDF

TL;DR

This paper introduces a sample-efficient reinforcement learning algorithm tailored for robotic table tennis, achieving effective learning within 200 episodes by leveraging a one-step environment and actor-critic methods.

Contribution

The paper presents a novel, sample-efficient RL approach for robotic table tennis that operates with minimal trials by embedding the method into the robot system and using a one-step environment.

Findings

01

Achieves competitive performance in simulation and real robot scenarios.

02

Learns accurate return strategies without pre-training in under 200 episodes.

03

Demonstrates effectiveness in high-dimensional, continuous state spaces.

Abstract

Reinforcement learning (RL) has achieved some impressive recent successes in various computer games and simulations. Most of these successes are based on having large numbers of episodes from which the agent can learn. In typical robotic applications, however, the number of feasible attempts is very limited. In this paper we present a sample-efficient RL algorithm applied to the example of a table tennis robot. In table tennis every stroke is different, with varying placement, speed and spin. An accurate return therefore has to be found depending on a high-dimensional continuous state space. To make learning in few trials possible the method is embedded into our robot system. In this way we can use a one-step environment. The state space depends on the ball at hitting time (position, velocity, spin) and the action is the racket state (orientation, velocity) at hitting. An actor-critic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.