Forward Only Learning for Orthogonal Neural Networks of any Depth

Paul Caillon; Alex Colagrande; Erwan Fagnou; Blaise Delattre; Alexandre Allauzen

arXiv:2512.20668·cs.LG·December 25, 2025

Forward Only Learning for Orthogonal Neural Networks of any Depth

Paul Caillon, Alex Colagrande, Erwan Fagnou, Blaise Delattre, Alexandre Allauzen

PDF

Open Access

TL;DR

This paper introduces FOTON, a forward-only training algorithm for orthogonal neural networks that scales to any depth and outperforms previous methods like PEPITA, reducing computational costs without requiring backpropagation.

Contribution

The paper presents FOTON, a novel forward-only training method for orthogonal neural networks that overcomes scalability issues of prior approaches and bridges the gap with backpropagation.

Findings

01

FOTON outperforms PEPITA in training depth and accuracy.

02

FOTON enables training of neural networks of any depth without backward passes.

03

Performance on convolutional networks suggests broader applicability.

Abstract

Backpropagation is still the de facto algorithm used today to train neural networks. With the exponential growth of recent architectures, the computational cost of this algorithm also becomes a burden. The recent PEPITA and forward-only frameworks have proposed promising alternatives, but they failed to scale up to a handful of hidden layers, yet limiting their use. In this paper, we first analyze theoretically the main limitations of these approaches. It allows us the design of a forward-only algorithm, which is equivalent to backpropagation under the linear and orthogonal assumptions. By relaxing the linear assumption, we then introduce FOTON (Forward-Only Training of Orthogonal Networks) that bridges the gap with the backpropagation algorithm. Experimental results show that it outperforms PEPITA, enabling us to train neural networks of any depth, without…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Stochastic Gradient Optimization Techniques · Machine Learning and ELM