Estimation of Stochastic Attribute-Value Grammars using an Informative   Sample

Miles Osborne

arXiv:cs/0008024·cs.CL·May 23, 2007

Estimation of Stochastic Attribute-Value Grammars using an Informative Sample

Miles Osborne

PDF

Open Access

TL;DR

This paper proposes using an informative subset of training data to efficiently estimate stochastic attribute-value grammars, showing that it can outperform full data training in some cases and that Gaussian Priors help reduce overfitting in unlexicalised models.

Contribution

It introduces a method for training stochastic attribute-value grammars on informative samples, reducing computational complexity and improving estimation accuracy in certain scenarios.

Findings

01

Informative samples can outperform full data training in some cases.

02

Gaussian Priors help reduce overfitting in unlexicalised models.

03

Lexicalised models with overlapping features are less affected by overfitting.

Abstract

We argue that some of the computational complexity associated with estimation of stochastic attribute-value grammars can be reduced by training upon an informative subset of the full training set. Results using the parsed Wall Street Journal corpus show that in some circumstances, it is possible to obtain better estimation results using an informative sample than when training upon all the available material. Further experimentation demonstrates that with unlexicalised models, a Gaussian Prior can reduce overfitting. However, when models are lexicalised and contain overlapping features, overfitting does not seem to be a problem, and a Gaussian Prior makes minimal difference to performance. Our approach is applicable for situations when there are an infeasibly large number of parses in the training set, or else for when recovery of these parses from a packed representation is itself…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques