Structured Control Nets for Deep Reinforcement Learning

Mario Srouji; Jian Zhang; Ruslan Salakhutdinov

arXiv:1802.08311·cs.LG·March 13, 2020·6 cites

Structured Control Nets for Deep Reinforcement Learning

Mario Srouji, Jian Zhang, Ruslan Salakhutdinov

PDF

Open Access 1 Repo

TL;DR

This paper introduces Structured Control Nets, a neural network architecture for deep reinforcement learning that combines linear and nonlinear modules to improve training efficiency, reward, and generalization across various control tasks.

Contribution

The paper proposes a novel neural network architecture, Structured Control Net, that splits the policy network into linear and nonlinear modules to enhance performance and efficiency in reinforcement learning.

Findings

01

Achieved competitive results on MuJoCo, Roboschool, Atari, and urban driving environments.

02

Demonstrated improved locomotion performance by emulating biological CPGs.

03

Validated benefits across multiple training methods and ablation tests.

Abstract

In recent years, Deep Reinforcement Learning has made impressive advances in solving several important benchmark problems for sequential decision making. Many control applications use a generic multilayer perceptron (MLP) for non-vision parts of the policy network. In this work, we propose a new neural network architecture for the policy network representation that is simple yet effective. The proposed Structured Control Net (SCN) splits the generic MLP into two separate sub-modules: a nonlinear control module and a linear control module. Intuitively, the nonlinear control is for forward-looking and global control, while the linear control stabilizes the local dynamics around the residual of global control. We hypothesize that this will bring together the benefits of both linear and nonlinear policies: improve training sample efficiency, final episodic reward, and generalization of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wongongv/scnwithdqn
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Zebrafish Biomedical Research Applications