Beyond Weighted Summation: Learnable Nonlinear Aggregation Functions for Robust Artificial Neurons
Berke Deniz Bozyigit

TL;DR
This paper explores learnable nonlinear aggregation functions in artificial neurons to enhance robustness against noisy inputs, demonstrating improved performance on CIFAR-10 with hybrid neurons that adaptively combine linear and nonlinear aggregation.
Contribution
Introduces differentiable nonlinear aggregation mechanisms and hybrid neurons that interpolate between linear and nonlinear aggregation for improved robustness.
Findings
Hybrid neurons significantly improve noise robustness.
Learned aggregation parameters favor sub-linear and high novelty utilization.
Modest gains observed on clean data with F-Mean hybrids.
Abstract
Weighted summation has remained the default input aggregation mechanism in artificial neurons since the earliest neural network models. While computationally efficient, this design implicitly behaves like a mean-based estimator and is therefore sensitive to noisy or extreme inputs. This paper investigates whether replacing fixed linear aggregation with learnable nonlinear alternatives can improve neural network robustness without sacrificing trainability. Two differentiable aggregation mechanisms are introduced: an F-Mean neuron based on a learnable power-weighted aggregation rule, and a Gaussian Support neuron based on distance-aware affinity weighting. To preserve the optimisation stability of standard neurons, hybrid neurons are proposed that interpolate between linear and nonlinear aggregation through a learnable blending parameter. Evaluated in multilayer perceptrons and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Stochastic Gradient Optimization Techniques · Machine Learning and ELM
