On the Improvement of Generalization and Stability of Forward-Only Learning via Neural Polarization
Erik B. Terres-Escudero, Javier Del Ser, Pablo Garcia-Bringas

TL;DR
This paper introduces Polar-FFA, an improved version of the Forward-Forward Algorithm that enhances generalization and stability by balancing gradients through neural polarization, leading to better accuracy and faster convergence.
Contribution
The paper proposes Polar-FFA, a novel extension of FFA that employs neural polarization to address gradient imbalance, improving training stability and performance.
Findings
Polar-FFA outperforms FFA in accuracy and convergence speed.
Polar-FFA requires less hyperparameter tuning.
Polar-FFA demonstrates improved generalization across datasets.
Abstract
Forward-only learning algorithms have recently gained attention as alternatives to gradient backpropagation, replacing the backward step of this latter solver with an additional contrastive forward pass. Among these approaches, the so-called Forward-Forward Algorithm (FFA) has been shown to achieve competitive levels of performance in terms of generalization and complexity. Networks trained using FFA learn to contrastively maximize a layer-wise defined goodness score when presented with real data (denoted as positive samples) and to minimize it when processing synthetic data (corr. negative samples). However, this algorithm still faces weaknesses that negatively affect the model accuracy and training stability, primarily due to a gradient imbalance between positive and negative samples. To overcome this issue, in this work we propose a novel implementation of the FFA algorithm, denoted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Target Tracking and Data Fusion in Sensor Networks · Machine Learning and ELM
MethodsSoftmax · Attention Is All You Need
