Optimizing Spectral Learning for Parsing

Shashi Narayan; Shay B. Cohen

arXiv:1606.02342·cs.CL·June 15, 2016

Optimizing Spectral Learning for Parsing

Shashi Narayan, Shay B. Cohen

PDF

Open Access

TL;DR

This paper introduces a global optimization approach for spectral learning of latent-variable PCFGs, demonstrating improved parsing accuracy across multiple languages by considering interactions between nonterminals.

Contribution

It presents a novel search algorithm for optimizing latent states globally in spectral methods, challenging the belief that states can be set independently.

Findings

01

Global optimization improves parsing results

02

Spectral methods perform comparably to EM techniques

03

Effective across diverse morphologically rich languages

Abstract

We describe a search algorithm for optimizing the number of latent states when estimating latent-variable PCFGs with spectral methods. Our results show that contrary to the common belief that the number of latent states for each nonterminal in an L-PCFG can be decided in isolation with spectral methods, parsing results significantly improve if the number of latent states for each nonterminal is globally optimized, while taking into account interactions between the different nonterminals. In addition, we contribute an empirical analysis of spectral algorithms on eight morphologically rich languages: Basque, French, German, Hebrew, Hungarian, Korean, Polish and Swedish. Our results show that our estimation consistently performs better or close to coarse-to-fine expectation-maximization techniques for these languages.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques