Extending a Parser to Distant Domains Using a Few Dozen Partially   Annotated Examples

Vidur Joshi; Matthew Peters; Mark Hopkins

arXiv:1805.06556·cs.CL·May 18, 2018

Extending a Parser to Distant Domains Using a Few Dozen Partially Annotated Examples

Vidur Joshi, Matthew Peters, Mark Hopkins

PDF

1 Repo

TL;DR

This paper demonstrates that modern word representations reduce the need for domain adaptation in parsers, and introduces a simple method to adapt parsers to distant domains using only dozens of partially annotated examples, achieving significant improvements.

Contribution

It shows that recent word representations lessen domain adaptation needs and proposes a straightforward adaptation method with minimal annotations for distant domains.

Findings

01

Achieved over 90% F1 on Brown corpus with a parser trained only on Wall Street Journal.

02

Improved geometry-domain parse accuracy from 45% to 73% with about fifty partial annotations.

03

Set a new state-of-the-art single model result on WSJ test set at 94.3% F1.

Abstract

We revisit domain adaptation for parsers in the neural era. First we show that recent advances in word representations greatly diminish the need for domain adaptation when the target domain is syntactically similar to the source domain. As evidence, we train a parser on the Wall Street Jour- nal alone that achieves over 90% F1 on the Brown corpus. For more syntactically dis- tant domains, we provide a simple way to adapt a parser using only dozens of partial annotations. For instance, we increase the percentage of error-free geometry-domain parses in a held-out set from 45% to 73% using approximately five dozen training examples. In the process, we demon- strate a new state-of-the-art single model result on the Wall Street Journal test set of 94.3%. This is an absolute increase of 1.7% over the previous state-of-the-art of 92.6%.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vidurj/parser-adaptation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.