Sequence-to-Sequence Models with Attention Mechanistically Map to the Architecture of Human Memory Search
Nikolaus Salvatore, Qiong Zhang

TL;DR
This paper shows that neural sequence-to-sequence models with attention mechanisms mirror human memory search processes, offering interpretable models that align with cognitive theories and replicate human behavioral patterns.
Contribution
It reveals the mechanistic correspondence between neural translation models and human memory architectures, providing a novel, interpretable cognitive model of memory search.
Findings
Model accounts for human behavioral patterns in memory search
Neural models exhibit mechanisms similar to human memory processes
Model captures complex learning dynamics in memory retrieval
Abstract
Past work has long recognized the important role of context in guiding how humans search their memory. While context-based memory models can explain many memory phenomena, it remains unclear why humans develop such architectures over possible alternatives in the first place. In this work, we demonstrate that foundational architectures in neural machine translation -- specifically, recurrent neural network (RNN)-based sequence-to-sequence models with attention -- exhibit mechanisms that directly correspond to those specified in the Context Maintenance and Retrieval (CMR) model of human memory. Since neural machine translation models have evolved to optimize task performance, their convergence with human memory models provides a deeper understanding of the functional role of context in human memory, as well as presenting new ways to model human memory. Leveraging this convergence, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
