Extending Machine Language Models toward Human-Level Language   Understanding

James L. McClelland; Felix Hill; Maja Rudolph; Jason Baldridge and; Hinrich Sch\"utze

arXiv:1912.05877·cs.CL·July 7, 2020

Extending Machine Language Models toward Human-Level Language Understanding

James L. McClelland, Felix Hill, Maja Rudolph, Jason Baldridge and, Hinrich Sch\"utze

PDF

Open Access

TL;DR

This paper discusses extending language models to better emulate human-level understanding by incorporating cognitive neuroscience principles, memory, and context-aware processing.

Contribution

It proposes a framework combining neuroscience insights and AI techniques, especially query-based attention, to advance language models toward human-like understanding.

Findings

01

Current models excel at internal language tasks

02

Models lack memory of past situations outside fixed context

03

Future directions include integrating brain-inspired memory systems

Abstract

Language is crucial for human intelligence, but what exactly is its role? We take language to be a part of a system for understanding and communicating about situations. The human ability to understand and communicate about situations emerges gradually from experience and depends on domain-general principles of biological neural networks: connection-based learning, distributed representation, and context-sensitive, mutual constraint satisfaction-based processing. Current artificial language processing systems rely on the same domain general principles, embodied in artificial neural networks. Indeed, recent progress in this field depends on \emph{query-based attention}, which extends the ability of these systems to exploit context and has contributed to remarkable breakthroughs. Nevertheless, most current models focus exclusively on language-internal tasks, limiting their ability to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications