Contrastive Forward-Forward: A Training Algorithm of Vision Transformer
Hossein Aghagolzadeh, Mehdi Ezoji

TL;DR
This paper introduces Contrastive Forward-Forward, an improved training algorithm for Vision Transformers inspired by brain-like processes, achieving higher accuracy and faster convergence than the original Forward-Forward method and narrowing the gap with backpropagation.
Contribution
The paper extends the Forward-Forward algorithm to Vision Transformers and incorporates contrastive learning insights, significantly enhancing performance and convergence speed.
Findings
Up to 10% accuracy improvement over baseline Forward-Forward
Convergence accelerated by 5 to 20 times
Outperforms backpropagation under certain conditions
Abstract
Although backpropagation is widely accepted as a training algorithm for artificial neural networks, researchers are always looking for inspiration from the brain to find ways with potentially better performance. Forward-Forward is a novel training algorithm that is more similar to what occurs in the brain, although there is a significant performance gap compared to backpropagation. In the Forward-Forward algorithm, the loss functions are placed after each layer, and the updating of a layer is done using two local forward passes and one local backward pass. Forward-Forward is in its early stages and has been designed and evaluated on simple multi-layer perceptron networks to solve image classification tasks. In this work, we have extended the use of this algorithm to a more complex and modern network, namely the Vision Transformer. Inspired by insights from contrastive learning, we have…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Neural Networks and Applications · Infrared Target Detection Methodologies
MethodsAttention Is All You Need · Label Smoothing · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Adam · Softmax · Dropout · Absolute Position Encodings · Transformer · Linear Layer
