Transition-based Parsing with Lighter Feed-Forward Networks

David Vilares; Carlos G\'omez-Rodr\'iguez

arXiv:1810.08997·cs.CL·October 23, 2018·1 cites

Transition-based Parsing with Lighter Feed-Forward Networks

David Vilares, Carlos G\'omez-Rodr\'iguez

PDF

Open Access

TL;DR

This paper investigates how to create lighter, faster transition-based dependency parsers using feed-forward networks without sacrificing accuracy across various languages.

Contribution

It demonstrates that reducing feature complexity and embedding sizes can significantly speed up parsers while maintaining statistical equivalence.

Findings

01

Grand-daughter features can be removed without accuracy loss.

02

Embedding sizes can be substantially reduced.

03

Speed-ups are achieved with negligible impact on LAS.

Abstract

We explore whether it is possible to build lighter parsers, that are statistically equivalent to their corresponding standard version, for a wide set of languages showing different structures and morphologies. As testbed, we use the Universal Dependencies and transition-based dependency parsers trained on feed-forward networks. For these, most existing research assumes de facto standard embedded features and relies on pre-computation tricks to obtain speed-ups. We explore how these features and their size can be reduced and whether this translates into speed-ups with a negligible impact on accuracy. The experiments show that grand-daughter features can be removed for the majority of treebanks without a significant (negative or positive) LAS difference. They also show how the size of the embeddings can be notably reduced.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Machine Learning in Bioinformatics