Convolutional Networks in Visual Environments
Alessandro Betti, Marco Gori

TL;DR
This paper introduces a new theory of learning with convolutional networks based on motion invariance, enabling unsupervised feature learning from video signals through differential equations and cognitive action minimization.
Contribution
It proposes a novel unsupervised learning framework for convolutional networks driven by motion invariance and differential equations, differing from traditional supervised methods.
Findings
Enables unsupervised learning of convolutional filters from video.
Uses differential equations derived from the principle of least cognitive action.
Provides a computational scheme inspired by biological vision processes.
Abstract
The puzzle of computer vision might find new challenging solutions when we realize that most successful methods are working at image level, which is remarkably more difficult than processing directly visual streams. In this paper, we claim that their processing naturally leads to formulate the motion invariance principle, which enables the construction of a new theory of learning with convolutional networks. The theory addresses a number of intriguing questions that arise in natural vision, and offers a well-posed computational scheme for the discovery of convolutional filters over the retina. They are driven by differential equations derived from the principle of least cognitive action. Unlike traditional convolutional networks, which need massive supervision, the proposed theory offers a truly new scenario in which feature learning takes place by unsupervised processing of video…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Advanced Image and Video Retrieval Techniques
