Extending Machine Language Models toward Human-Level Language Understanding
James L. McClelland, Felix Hill, Maja Rudolph, Jason Baldridge and, Hinrich Sch\"utze

TL;DR
This paper discusses extending language models to better emulate human-level understanding by incorporating cognitive neuroscience principles, memory, and context-aware processing.
Contribution
It proposes a framework combining neuroscience insights and AI techniques, especially query-based attention, to advance language models toward human-like understanding.
Findings
Current models excel at internal language tasks
Models lack memory of past situations outside fixed context
Future directions include integrating brain-inspired memory systems
Abstract
Language is crucial for human intelligence, but what exactly is its role? We take language to be a part of a system for understanding and communicating about situations. The human ability to understand and communicate about situations emerges gradually from experience and depends on domain-general principles of biological neural networks: connection-based learning, distributed representation, and context-sensitive, mutual constraint satisfaction-based processing. Current artificial language processing systems rely on the same domain general principles, embodied in artificial neural networks. Indeed, recent progress in this field depends on \emph{query-based attention}, which extends the ability of these systems to exploit context and has contributed to remarkable breakthroughs. Nevertheless, most current models focus exclusively on language-internal tasks, limiting their ability to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
