
TL;DR
This paper presents a Bayesian-based theory of language learning that models how children acquire language, emphasizing robustness, speed, and the ability to learn complex structures from minimal examples, aligning well with key language acquisition facts.
Contribution
It introduces a novel Bayesian framework using feature structures and scripts for language learning, capable of bootstrapping language from zero vocabulary with minimal data.
Findings
Learns complex meanings and syntax from about six examples
Robust to noise and irrelevant data
Aligns with key facts of language acquisition across multiple linguistic domains
Abstract
A theory of language learning is described, which uses Bayesian induction of feature structures (scripts) and script functions. Each word sense in a language is mentally represented by an m-script, a script function which embodies all the syntax and semantics of the word. M-scripts form a fully-lexicalised unification grammar, which can support adult language. Each word m-script can be learnt robustly from about six learning examples. The theory has been implemented as a computer model, which can bootstrap-learn a language from zero vocabulary. The Bayesian learning mechanism is (1) Capable: to learn arbitrarily complex meanings and syntactic structures; (2) Fast: learning these structures from a few examples each; (3) Robust: learning in the presence of much irrelevant noise, and (4) Self-repairing: able to acquire implicit negative evidence, using it to learn exceptions. Children…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Syntax, Semantics, Linguistic Variation · Language Development and Disorders
