Improving Quaternion Neural Networks with Quaternionic Activation Functions
Johannes P\"oppelbaum, Andreas Schwung

TL;DR
This paper introduces novel quaternion activation functions that modify either magnitude or phase, leveraging quaternion properties to improve neural network performance, especially in image classification tasks, by enhancing gradient flow and utilizing quaternion algebra.
Contribution
The paper proposes new quaternion activation functions based on quaternion properties, offering an alternative to split activations and demonstrating improved performance in image classification.
Findings
Quaternion activation functions improve performance on CIFAR-10 and SVHN datasets.
Phase-modifying quaternion activations provide consistent performance gains.
Proposed functions enhance gradient flow in quaternion neural networks.
Abstract
In this paper, we propose novel quaternion activation functions where we modify either the quaternion magnitude or the phase, as an alternative to the commonly used split activation functions. We define criteria that are relevant for quaternion activation functions, and subsequently we propose our novel activation functions based on this analysis. Instead of applying a known activation function like the ReLU or Tanh on the quaternion elements separately, these activation functions consider the quaternion properties and respect the quaternion space . In particular, all quaternion components are utilized to calculate all output components, carrying out the benefit of the Hamilton product in e.g. the quaternion convolution to the activation functions. The proposed activation functions can be incorporated in arbitrary quaternion valued neural networks trained with gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsConvolution
