Structural Optimization Ambiguity and Simplicity Bias in Unsupervised   Neural Grammar Induction

Jinwook Park; Kangil Kim

arXiv:2407.16181·cs.CL·July 24, 2024

Structural Optimization Ambiguity and Simplicity Bias in Unsupervised Neural Grammar Induction

Jinwook Park, Kangil Kim

PDF

1 Repo

TL;DR

This paper identifies structural ambiguity and simplicity bias in unsupervised neural grammar induction, analyzing their origins and proposing a sentence-wise parse focusing method to improve accuracy and interpretability.

Contribution

It introduces a novel sentence-wise parse focusing approach that leverages pre-trained parsers to reduce ambiguity and bias in unsupervised neural grammar induction.

Findings

01

Significant performance improvements on unsupervised parsing benchmarks

02

Reduction in prediction variance and bias towards simple parses

03

Enhanced interpretability of learned grammars

Abstract

Neural parameterization has significantly advanced unsupervised grammar induction. However, training these models with a traditional likelihood loss for all possible parses exacerbates two issues: 1) $structural optimization ambiguity$ that arbitrarily selects one among structurally ambiguous optimal grammars despite the specific preference of gold parses, and 2) $structural simplicity bias$ that leads a model to underutilize rules to compose parse trees. These challenges subject unsupervised neural grammar induction (UNGI) to inevitable prediction errors, high variance, and the necessity for extensive grammars to achieve accurate predictions. This paper tackles these issues, offering a comprehensive analysis of their origins. As a solution, we introduce $sentence-wise parse-focusing$ to reduce the parse pool per sentence for loss evaluation, using the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

GIST-IRR/Parse-Focusing
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.