Sequence-to-sequence models with attention mechanistically map to the architecture of human memory search
Nikolaus Salvatore, Qiong Zhang

TL;DR
This study shows that machine learning models with attention mechanisms work similarly to how humans search their memory, offering new ways to understand and model memory.
Contribution
The study reveals that sequence-to-sequence models with attention mechanistically align with human memory search architectures.
Findings
Sequence-to-sequence models with attention mirror the CMR model of human memory.
The model captures both average and optimal human memory behaviors using a free recall dataset.
The model's performance emerges from interactions between its components.
Abstract
Past work has long recognized the important role of context in guiding how humans search their memory. While context-based memory models can explain many memory phenomena, it remains unclear why humans develop such architectures over possible alternatives in the first place. In this work, we demonstrate that foundational architectures in neural machine translation – specifically, recurrent neural network (RNN)-based sequence-to-sequence models with attention – exhibit mechanisms that directly correspond to those specified in the Context Maintenance and Retrieval (CMR) model of human memory. Since neural machine translation models have evolved to optimize task performance, their convergence with human memory models provides a deeper understanding of the functional role of context in human memory, as well as presenting alternative ways to model human memory. Leveraging this convergence,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMemory Processes and Influences · Topic Modeling · Domain Adaptation and Few-Shot Learning
