Consensus Function from an $L_p^q-$norm Regularization Term for its Use as Adaptive Activation Functions in Neural Networks
Juan Heredia-Juesas, Jos\'e \'A. Mart\'inez-Lorenzo

TL;DR
This paper introduces a novel adaptive activation function derived from an $L_p^q$ regularization term, enabling neural networks to dynamically modify their activation shapes during training, which improves performance in regression and classification tasks.
Contribution
The paper proposes a new implicit, parametric activation function based on consensus variables from $L_p^q$ regularization, allowing adaptive behavior during training and simplifying network architecture design.
Findings
Reduces error in regression tasks
Improves classification accuracy
Enhances network flexibility
Abstract
The design of a neural network is usually carried out by defining the number of layers, the number of neurons per layer, their connections or synapses, and the activation function that they will execute. The training process tries to optimize the weights assigned to those connections, together with the biases of the neurons, to better fit the training data. However, the definition of the activation functions is, in general, determined in the design process and not modified during the training, meaning that their behavior is unrelated to the training data set. In this paper we propose the definition and utilization of an implicit, parametric, non-linear activation function that adapts its shape during the training process. This fact increases the space of parameters to optimize within the network, but it allows a greater flexibility and generalizes the concept of neural networks.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and ELM · Non-Destructive Testing Techniques
