Benchmarking Neural Network Generalization for Grammar Induction

Nur Lan; Emmanuel Chemla; Roni Katzir

arXiv:2308.08253·cs.CL·August 28, 2023

Benchmarking Neural Network Generalization for Grammar Induction

Nur Lan, Emmanuel Chemla, Roni Katzir

PDF

Open Access 1 Repo

TL;DR

This paper introduces a formal language-based benchmark to evaluate neural network generalization in grammar induction, revealing that MDL-trained models outperform standard ones in data efficiency and generalization.

Contribution

It proposes a new formal language-based benchmark for neural network generalization and demonstrates that MDL training improves generalization over standard loss functions.

Findings

01

MDL-trained networks generalize better with less data

02

The benchmark includes languages like a^nb^n and Dyck-1, 2

03

Networks' generalization scores correlate with training data size

Abstract

How well do neural networks generalize? Even for grammar induction tasks, where the target generalization is fully known, previous works have left the question open, testing very limited ranges beyond the training set and using different success criteria. We provide a measure of neural network generalization based on fully specified formal languages. Given a model and a formal grammar, the method assigns a generalization score representing how well a model generalizes to unseen samples in inverse relation to the amount of data it was trained on. The benchmark includes languages such as $a^{n} b^{n}$ , $a^{n} b^{n} c^{n}$ , $a^{n} b^{m} c^{n + m}$ , and Dyck-1 and 2. We evaluate selected architectures using the benchmark and find that networks trained with a Minimum Description Length objective (MDL) generalize better and using less data than networks trained using standard loss functions. The benchmark is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

taucompling/bliss
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications