Automatic Selection of Context Configurations for Improved   Class-Specific Word Representations

Ivan Vuli\'c; Roy Schwartz; Ari Rappoport; Roi Reichart; and Anna; Korhonen

arXiv:1608.05528·cs.CL·June 13, 2017

Automatic Selection of Context Configurations for Improved Class-Specific Word Representations

Ivan Vuli\'c, Roy Schwartz, Ari Rappoport, Roi Reichart, and Anna, Korhonen

PDF

TL;DR

This paper presents an automatic framework for selecting class-specific context configurations to improve word representations, demonstrating effectiveness across multiple languages and reducing training time.

Contribution

It introduces a universal dependency-based context configuration space and an efficient search algorithm for optimizing class-specific contexts in word embedding models.

Findings

01

Improves correlation with human similarity scores by 5-6 points.

02

Reduces training data by up to 66%.

03

Generalizes across languages and training setups.

Abstract

This paper is concerned with identifying contexts useful for training word representation models for different word classes such as adjectives (A), verbs (V), and nouns (N). We introduce a simple yet effective framework for an automatic selection of class-specific context configurations. We construct a context configuration space based on universal dependency relations between words, and efficiently search this space with an adapted beam search algorithm. In word similarity tasks for each word class, we show that our framework is both effective and efficient. Particularly, it improves the Spearman's rho correlation with human scores on SimLex-999 over the best previously proposed class-specific contexts by 6 (A), 6 (V) and 5 (N) rho points. With our selected context configurations, we train on only 14% (A), 26.2% (V), and 33.6% (N) of all dependency-based contexts, resulting in a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.