Recurrent babbling: evaluating the acquisition of grammar from limited   input data

Ludovica Pannitto; Aur\'elie Herbelot

arXiv:2010.04637·cs.CL·November 12, 2024

Recurrent babbling: evaluating the acquisition of grammar from limited input data

Ludovica Pannitto, Aur\'elie Herbelot

PDF

TL;DR

This paper investigates how Long Short-Term Memory networks (LSTMs) can learn grammatical structures from realistic, limited child-directed input, demonstrating that they develop abstraction over time similar to language acquisition.

Contribution

It introduces a novel methodology to quantify grammatical abstraction in neural network output and shows LSTMs can learn and generalize grammar from limited, realistic data.

Findings

01

LSTMs abstract grammatical structures over time

02

The methodology quantifies grammatical development in neural models

03

LSTMs can learn from realistic, limited input data

Abstract

Recurrent Neural Networks (RNNs) have been shown to capture various aspects of syntax from raw linguistic input. In most previous experiments, however, learning happens over unrealistic corpora, which do not reflect the type and amount of data a child would be exposed to. This paper remedies this state of affairs by training a Long Short-Term Memory network (LSTM) over a realistically sized subset of child-directed input. The behaviour of the network is analysed over time using a novel methodology which consists in quantifying the level of grammatical abstraction in the model's generated output (its "babbling"), compared to the language it has been exposed to. We show that the LSTM indeed abstracts new structuresas learning proceeds.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTanh Activation · Sigmoid Activation · Memory Network · Long Short-Term Memory