Learning to Embed Words in Context for Syntactic Tasks

Lifu Tu; Kevin Gimpel; Karen Livescu

arXiv:1706.02807·cs.CL·June 13, 2017·2 cites

Learning to Embed Words in Context for Syntactic Tasks

Lifu Tu, Kevin Gimpel, Karen Livescu

PDF

Open Access

TL;DR

This paper introduces context-dependent token embedding models that improve syntactic task performance by capturing word sense and syntactic roles, trained on large unannotated corpora and tested on smaller annotated datasets.

Contribution

It presents simple neural network-based token embedding models that effectively encode context-specific word features for syntactic tasks, demonstrating improved performance over baselines.

Findings

01

Token embeddings outperform baseline models in syntactic tasks.

02

Models trained on large unannotated data improve small-data task performance.

03

Embedding models are efficient and adaptable across different context window sizes.

Abstract

We present models for embedding words in the context of surrounding words. Such models, which we refer to as token embeddings, represent the characteristics of a word that are specific to a given context, such as word sense, syntactic category, and semantic role. We explore simple, efficient token embedding models based on standard neural network architectures. We learn token embeddings on a large amount of unannotated text and evaluate them as features for part-of-speech taggers and dependency parsers trained on much smaller amounts of annotated data. We find that predictors endowed with token embeddings consistently outperform baseline predictors across a range of context window and training set sizes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification