Robotic Table Tennis with Model-Free Reinforcement Learning

Wenbo Gao; Laura Graesser; Krzysztof Choromanski; Xingyou; Song; Nevena Lazic; Pannag Sanketi; Vikas Sindhwani; Navdeep; Jaitly

arXiv:2003.14398·cs.LG·May 29, 2020·5 cites

Robotic Table Tennis with Model-Free Reinforcement Learning

Wenbo Gao, Laura Graesser, Krzysztof Choromanski, Xingyou, Song, Nevena Lazic, Pannag Sanketi, Vikas Sindhwani, Navdeep, Jaitly

PDF

Open Access

TL;DR

This paper introduces a model-free reinforcement learning approach for robotic table tennis, achieving high return rates and multi-modal stroke styles without architectural priors.

Contribution

It demonstrates that evolutionary search with CNN-based policies can learn smooth, multi-modal control strategies for fast robotic table tennis without prior architectural constraints.

Findings

01

Achieved 80% return rate on diverse ball throws

02

Developed multi-modal forehand and backhand strokes

03

Learned smooth, efficient policies at 100Hz control rate

Abstract

We propose a model-free algorithm for learning efficient policies capable of returning table tennis balls by controlling robot joints at a rate of 100Hz. We demonstrate that evolutionary search (ES) methods acting on CNN-based policy architectures for non-visual inputs and convolving across time learn compact controllers leading to smooth motions. Furthermore, we show that with appropriately tuned curriculum learning on the task and rewards, policies are capable of developing multi-modal styles, specifically forehand and backhand stroke, whilst achieving 80\% return rate on a wide range of ball throws. We observe that multi-modality does not require any architectural priors, such as multi-head architectures or hierarchical policies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Path Planning Algorithms