Grid Cell Path Integration For Movement-Based Visual Object Recognition
Niels Leadholm (1, 2), Marcus Lewis (1), Subutai Ahmad (1) ((1), Numenta, (2) The University of Oxford)

TL;DR
This paper introduces GridCellNet, a neural network model inspired by grid cells that supports reliable object recognition from arbitrary sequences of visual inputs, enabling few-shot learning and partial reconstruction.
Contribution
The work presents a novel grid cell-based network for vision that handles arbitrary input sequences, enabling efficient recognition and feature prediction with few examples.
Findings
GridCellNet outperforms k-NN and RNNs in sequence-based object recognition.
The model generalizes to unseen examples and novel trajectories.
It can reconstruct images from partial input samples.
Abstract
Grid cells enable the brain to model the physical space of the world and navigate effectively via path integration, updating self-position using information from self-movement. Recent proposals suggest that the brain might use similar mechanisms to understand the structure of objects in diverse sensory modalities, including vision. In machine vision, object recognition given a sequence of sensory samples of an image, such as saccades, is a challenging problem when the sequence does not follow a consistent, fixed pattern - yet this is something humans do naturally and effortlessly. We explore how grid cell-based path integration in a cortical network can support reliable recognition of objects given an arbitrary sequence of inputs. Our network (GridCellNet) uses grid cell computations to integrate visual information and make predictions based on movements. We use local Hebbian plasticity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural dynamics and brain function · Neural Networks and Applications · CCD and CMOS Imaging Sensors
