Sigmoid-Weighted Linear Units for Neural Network Function Approximation in Reinforcement Learning
Stefan Elfwing, Eiji Uchibe, Kenji Doya

TL;DR
This paper introduces SiLU and dSiLU activation functions for neural networks in reinforcement learning, demonstrating their effectiveness in surpassing DQN performance in Atari games and other benchmarks.
Contribution
It proposes novel activation functions for reinforcement learning neural networks and shows they can outperform DQN using simpler on-policy methods.
Findings
Achieved state-of-the-art results in Tetris variants with shallow networks.
Outperformed DQN in Atari 2600 games with deep Sarsa(λ) and SiLU/dSiLU units.
Validated the effectiveness of on-policy learning with these activations.
Abstract
In recent years, neural networks have enjoyed a renaissance as function approximators in reinforcement learning. Two decades after Tesauro's TD-Gammon achieved near top-level human performance in backgammon, the deep reinforcement learning algorithm DQN achieved human-level performance in many Atari 2600 games. The purpose of this study is twofold. First, we propose two activation functions for neural network function approximation in reinforcement learning: the sigmoid-weighted linear unit (SiLU) and its derivative function (dSiLU). The activation of the SiLU is computed by the sigmoid function multiplied by its input. Second, we suggest that the more traditional approach of using on-policy learning with eligibility traces, instead of experience replay, and softmax action selection with simple annealing can be competitive with DQN, without the need for a separate target network. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Evolutionary Algorithms and Applications
MethodsQ-Learning · Dense Connections · Convolution · Sigmoid Linear Unit · Deep Q-Network
