Hadamard Representations: Augmenting Hyperbolic Tangents in RL

Jacob E. Kooi; Mark Hoogendoorn; Vincent Fran\c{c}ois-Lavet

arXiv:2406.09079·cs.LG·May 20, 2025

Hadamard Representations: Augmenting Hyperbolic Tangents in RL

Jacob E. Kooi, Mark Hoogendoorn, Vincent Fran\c{c}ois-Lavet

PDF

Open Access

TL;DR

This paper introduces Hadamard representations to enhance hyperbolic tangent activations in reinforcement learning, addressing dying neuron issues and improving learning efficiency and representation capacity.

Contribution

It proposes a novel Hadamard representation method that mitigates dying neurons in continuous activations like tanh, boosting RL performance.

Findings

01

Faster learning in Atari benchmarks

02

Reduced dead neurons in neural networks

03

Increased effective rank of activations

Abstract

Activation functions are one of the key components of a deep neural network. The most commonly used activation functions can be classed into the category of continuously differentiable (e.g. tanh) and piece-wise linear functions (e.g. ReLU), both having their own strengths and drawbacks with respect to downstream performance and representation capacity through learning. In reinforcement learning, the performance of continuously differentiable activations often falls short as compared to piece-wise linear functions. We show that the dying neuron problem in RL is not exclusive to ReLUs and actually leads to additional problems in the case of continuously differentiable activations such as tanh. To alleviate the dying neuron problem with these activations, we propose a Hadamard representation that unlocks the advantages of continuously differentiable activations. Using DQN, PPO and PQN in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Music and Audio Processing · Human Motion and Animation

MethodsConvolution · Q-Learning · Dense Connections · Deep Q-Network · Entropy Regularization · Proximal Policy Optimization