Multikernel activation functions: formulation and a case study
Simone Scardapane, Elena Nieddu, Donatella Firmani, Paolo Merialdo

TL;DR
This paper introduces multikernel activation functions (multi-KAFs) for neural networks, combining multiple kernels to improve learning flexibility and accuracy, demonstrated through a Latin OCR case study with faster convergence and fewer parameters.
Contribution
The paper proposes a novel multi-KAF model that extends kernel activation functions by linearly combining multiple kernels, enhancing performance and convergence in neural networks.
Findings
Multi-KAFs improve OCR accuracy on Latin script.
Multi-KAFs lead to faster convergence in training.
Multi-KAFs achieve better results with fewer parameters.
Abstract
The design of activation functions is a growing research area in the field of neural networks. In particular, instead of using fixed point-wise functions (e.g., the rectified linear unit), several authors have proposed ways of learning these functions directly from the data in a non-parametric fashion. In this paper we focus on the kernel activation function (KAF), a recently proposed framework wherein each function is modeled as a one-dimensional kernel model, whose weights are adapted through standard backpropagation-based optimization. One drawback of KAFs is the need to select a single kernel function and its eventual hyper-parameters. To partially overcome this problem, we motivate an extension of the KAF model, in which multiple kernels are linearly combined at every neuron, inspired by the literature on multiple kernel learning. We provide an application of the resulting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Neural Network Applications · Machine Learning and Data Classification
MethodsKernel Activation Function
