Towards Generalization and Simplicity in Continuous Control

Aravind Rajeswaran; Kendall Lowrey; Emanuel Todorov; Sham Kakade

arXiv:1703.02660·cs.LG·March 21, 2018·26 cites

Towards Generalization and Simplicity in Continuous Control

Aravind Rajeswaran, Kendall Lowrey, Emanuel Todorov, Sham Kakade

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that simple linear and RBF policies can effectively solve continuous control tasks, offering competitive performance and better generalization compared to complex neural network policies, especially in diverse and perturbation scenarios.

Contribution

It shows that simple policy parameterizations can match complex models in continuous control, emphasizing the importance of diverse training for improved generalization.

Findings

01

Simple policies perform competitively on benchmarks.

02

Diverse initial states improve policy generalization.

03

Global policies recover from large perturbations.

Abstract

This work shows that policies with simple linear and RBF parameterizations can be trained to solve a variety of continuous control tasks, including the OpenAI gym benchmarks. The performance of these trained policies are competitive with state of the art results, obtained with more elaborate parameterizations such as fully connected neural networks. Furthermore, existing training and testing scenarios are shown to be very limited and prone to over-fitting, thus giving rise to only trajectory-centric policies. Training with a diverse initial state distribution is shown to produce more global policies with better generalization. This allows for interactive control scenarios where the system recovers from large on-line perturbations; as shown in the supplementary video.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

khansel01/nes-npg
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Model Reduction and Neural Networks · Adaptive Dynamic Programming Control