Discretization of continuous input spaces in the hippocampal autoencoder
Adrian F. Amil, Ismael T. Freire, Paul F. M. J. Verschure

TL;DR
This paper demonstrates that sparse autoencoder neurons can discretize continuous sensory spaces into high-dimensional, minimal-overlap codes, supporting spatial and auditory cognition and enhancing reinforcement learning performance.
Contribution
It introduces a novel approach where hippocampal-like autoencoder neurons discretize sensory spaces, unifying spatial and episodic memory functions in a computational model.
Findings
Neurons produce spatial tuning similar to place cells.
High-dimensional codes enable minimal-overlap discretization.
Reinforcement learning agents improve performance using these representations.
Abstract
The hippocampus has been associated with both spatial cognition and episodic memory formation, but integrating these functions into a unified framework remains challenging. Here, we demonstrate that forming discrete memories of visual events in sparse autoencoder neurons can produce spatial tuning similar to hippocampal place cells. We then show that the resulting very high-dimensional code enables neurons to discretize and tile the underlying image space with minimal overlap. Additionally, we extend our results to the auditory domain, showing that neurons similarly tile the frequency space in an experience-dependent manner. Lastly, we show that reinforcement learning agents can effectively perform various visuo-spatial cognitive tasks using these sparse, very high-dimensional representations.
Peer Reviews
Decision·Submitted to ICLR 2025
1. Interesting use of the Animal-AI environment to study how phenomena commonly found in biological brains (place cells) might emerge. 2. Paper is clearly written (minus a few missing experimental details)
1. Confounding Visual and Spatial Representations: The study compares sparse autoencoder representations derived from static visual images to hippocampal place cells, effectively confounding visual encoding with spatial encoding. The hippocampus integrates both sensory cues (such as vision) along with self-motion cues (vestibular, proprioceptive, and motor information). The analogy to hippocampal place cells is therefore weak, as the model solely captures visual features, which may not directly
#### **Originality** This paper introduces an innovative approach by using sparse autoencoders with orthonormal regularization to simulate place cell-like behavior, bridging neuroscience and machine learning. This combination of techniques is a creative application in hippocampal modeling, especially as it applies across sensory modalities (visual and auditory) and introduces a new metric for discretization of input spaces. #### **Quality** The experimental design is thorough and well-documente
### **Inaccurate use of terms** The paper’s use of terms like "memory" and "locality-sensitive hashing (LSH)" misrepresent its functionality, suggesting capabilities that the model doesn’t fully achieve. The term "memory" implies persistence, flexible retrieval, and time-based storage. Episodic memories, in particular, include context, time, and location (what, when, where). The model, however, lacks mechanisms for retaining and recalling information across time, relying instead on transient re
* The authors have conducted a comprehensive set of experiments, including: * spatial navigation with visual inputs: I really appreciate the usage of Animal-AI environments for the experimentation, which distinguishes this paper with earlier ones like Benna and Fusi (2021) that only performed the experimentation on simulated tasks with low-dimensional inputs. * In-depth analysis of the latent structure of the autoencoder. * Investigation of frequency-sensitive “place cells”, which shows
* Main contribution is unclear: * Despite the comprehensive experiments presented in this work, I found the main contribution poorly defined, especially given that earlier works (e.g., Benna and Fusi 2021, Santos-Pata et al. 2021) have followed similar autoencoder-based approaches. * For example, the authors stated in the final sentence of abstract that the findings demonstrate how sparsity gives rise to interpretable memories and thus establishes a link between memory and place cells. How
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsSparse Autoencoder
