Structured Control Nets for Deep Reinforcement Learning
Mario Srouji, Jian Zhang, Ruslan Salakhutdinov

TL;DR
This paper introduces Structured Control Nets, a neural network architecture for deep reinforcement learning that combines linear and nonlinear modules to improve training efficiency, reward, and generalization across various control tasks.
Contribution
The paper proposes a novel neural network architecture, Structured Control Net, that splits the policy network into linear and nonlinear modules to enhance performance and efficiency in reinforcement learning.
Findings
Achieved competitive results on MuJoCo, Roboschool, Atari, and urban driving environments.
Demonstrated improved locomotion performance by emulating biological CPGs.
Validated benefits across multiple training methods and ablation tests.
Abstract
In recent years, Deep Reinforcement Learning has made impressive advances in solving several important benchmark problems for sequential decision making. Many control applications use a generic multilayer perceptron (MLP) for non-vision parts of the policy network. In this work, we propose a new neural network architecture for the policy network representation that is simple yet effective. The proposed Structured Control Net (SCN) splits the generic MLP into two separate sub-modules: a nonlinear control module and a linear control module. Intuitively, the nonlinear control is for forward-looking and global control, while the linear control stabilizes the local dynamics around the residual of global control. We hypothesize that this will bring together the benefits of both linear and nonlinear policies: improve training sample efficiency, final episodic reward, and generalization of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Zebrafish Biomedical Research Applications
