Sigmoid-Weighted Linear Units for Neural Network Function Approximation   in Reinforcement Learning

Stefan Elfwing; Eiji Uchibe; Kenji Doya

arXiv:1702.03118·cs.LG·November 3, 2017·75 cites

Sigmoid-Weighted Linear Units for Neural Network Function Approximation in Reinforcement Learning

Stefan Elfwing, Eiji Uchibe, Kenji Doya

PDF

Open Access

TL;DR

This paper introduces SiLU and dSiLU activation functions for neural networks in reinforcement learning, demonstrating their effectiveness in surpassing DQN performance in Atari games and other benchmarks.

Contribution

It proposes novel activation functions for reinforcement learning neural networks and shows they can outperform DQN using simpler on-policy methods.

Findings

01

Achieved state-of-the-art results in Tetris variants with shallow networks.

02

Outperformed DQN in Atari 2600 games with deep Sarsa(λ) and SiLU/dSiLU units.

03

Validated the effectiveness of on-policy learning with these activations.

Abstract

In recent years, neural networks have enjoyed a renaissance as function approximators in reinforcement learning. Two decades after Tesauro's TD-Gammon achieved near top-level human performance in backgammon, the deep reinforcement learning algorithm DQN achieved human-level performance in many Atari 2600 games. The purpose of this study is twofold. First, we propose two activation functions for neural network function approximation in reinforcement learning: the sigmoid-weighted linear unit (SiLU) and its derivative function (dSiLU). The activation of the SiLU is computed by the sigmoid function multiplied by its input. Second, we suggest that the more traditional approach of using on-policy learning with eligibility traces, instead of experience replay, and softmax action selection with simple annealing can be competitive with DQN, without the need for a separate target network. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Evolutionary Algorithms and Applications

MethodsQ-Learning · Dense Connections · Convolution · Sigmoid Linear Unit · Deep Q-Network