A Theory of Language Learning

Robert Worden

arXiv:2106.14612·cs.CL·June 29, 2021

A Theory of Language Learning

Robert Worden

PDF

Open Access

TL;DR

This paper presents a Bayesian-based theory of language learning that models how children acquire language, emphasizing robustness, speed, and the ability to learn complex structures from minimal examples, aligning well with key language acquisition facts.

Contribution

It introduces a novel Bayesian framework using feature structures and scripts for language learning, capable of bootstrapping language from zero vocabulary with minimal data.

Findings

01

Learns complex meanings and syntax from about six examples

02

Robust to noise and irrelevant data

03

Aligns with key facts of language acquisition across multiple linguistic domains

Abstract

A theory of language learning is described, which uses Bayesian induction of feature structures (scripts) and script functions. Each word sense in a language is mentally represented by an m-script, a script function which embodies all the syntax and semantics of the word. M-scripts form a fully-lexicalised unification grammar, which can support adult language. Each word m-script can be learnt robustly from about six learning examples. The theory has been implemented as a computer model, which can bootstrap-learn a language from zero vocabulary. The Bayesian learning mechanism is (1) Capable: to learn arbitrarily complex meanings and syntactic structures; (2) Fast: learning these structures from a few examples each; (3) Robust: learning in the presence of much irrelevant noise, and (4) Self-repairing: able to acquire implicit negative evidence, using it to learn exceptions. Children…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Syntax, Semantics, Linguistic Variation · Language Development and Disorders