Covariance-Aware Goodness for Scalable Forward-Forward Learning
Xiaoyi Jiang, Bashir M. Al-Hashimi, Kai Xu

TL;DR
This paper introduces covariance-aware goodness functions and architectural enhancements to improve the scalability and performance of Forward-Forward learning on complex datasets like ImageNet-100 and Tiny-ImageNet.
Contribution
It proposes a novel covariance-aware goodness framework with structured second-order information, along with modules for feature alignment and aggregation, enabling deeper BP-free networks.
Findings
Achieves 73.01% on ImageNet-100
Attains 50.30% on Tiny-ImageNet
Extends effective Forward-Forward training to 16-layer architectures
Abstract
The Forward-Forward algorithm eliminates global gradient flow and full network activations storage. However, in convolutional settings, existing BP-free FF methods significantly under-perform backpropagation on complex benchmarks such as ImageNet-100 and Tiny-ImageNet. We identify this gap as a structural bottleneck in goodness extraction: standard sum-of-squares formulation collapses feature volumes into channel-wise activation energies which omits critical second-order dependencies. To address this, we propose a framework centered on three key components. First, Bi-axis Covariance Goodness(BiCovG) explicitly augments the standard goodness function with structured second-order information along two axes: cross-channel projections that model inter-feature covariance, and nested multi-scale aggregation that encodes spatial correlation statistics. This provides a tractable approximation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
