Recurrent Joint Embedding Predictive Architecture with Recurrent Forward Propagation Learning
Osvaldo M Velarde, Lucas C Parra

TL;DR
This paper proposes a biologically inspired recurrent vision network that predicts future image patches using a novel learning algorithm, Recurrent-Forward Propagation, which avoids traditional backpropagation complexities.
Contribution
It introduces a new recurrent joint embedding predictive architecture with a biologically plausible learning algorithm that ensures efficient training without representational collapse.
Findings
Mathematically proven that the algorithm implements exact gradient descent.
Empirical evidence shows the network learns efficiently.
Avoids backpropagation through time, making it more biologically plausible.
Abstract
Conventional computer vision models rely on very deep, feedforward networks processing whole images and trained offline with extensive labeled data. In contrast, biological vision relies on comparatively shallow, recurrent networks that analyze sequences of fixated image patches, learning continuously in real-time without explicit supervision. This work introduces a vision network inspired by these biological principles. Specifically, it leverages a joint embedding predictive architecture incorporating recurrent gated circuits. The network learns by predicting the representation of the next image patch (fixation) based on the sequence of past fixations, a form of self-supervised learning. We show mathematical and empirically that the training algorithm avoids the problem of representational collapse. We also introduce \emph{Recurrent-Forward Propagation}, a learning algorithm that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Advanced Data Compression Techniques · Text and Document Classification Technologies
