English Out-of-Vocabulary Lexical Evaluation Task

Han Wang; Ye Wang; Xinxiang Zhang; Mi Lu; Yoonsuck Choe; Jingjing Cao

arXiv:1804.04242·cs.CL·May 7, 2019·1 cites

English Out-of-Vocabulary Lexical Evaluation Task

Han Wang, Ye Wang, Xinxiang Zhang, Mi Lu, Yoonsuck Choe, Jingjing Cao

PDF

Open Access

TL;DR

This paper introduces a novel OOV lexical evaluation task focusing on classifying and predicting attributes of out-of-vocabulary words without prior knowledge, using unsupervised embeddings for baseline experiments.

Contribution

It pioneers an OOV lexical evaluation framework that does not rely on prior knowledge and applies unsupervised embeddings for classification and attribute prediction.

Findings

01

Baseline experiments with Word2Vec and Word2GM demonstrate effectiveness.

02

The task provides a new benchmark for OOV lexical evaluation.

03

Annotator-based attribute inference is feasible without prior knowledge.

Abstract

Unlike previous unknown nouns tagging task, this is the first attempt to focus on out-of-vocabulary (OOV) lexical evaluation tasks that do not require any prior knowledge. The OOV words are words that only appear in test samples. The goal of tasks is to provide solutions for OOV lexical classification and prediction. The tasks require annotators to conclude the attributes of the OOV words based on their related contexts. Then, we utilize unsupervised word embedding methods such as Word2Vec and Word2GM to perform the baseline experiments on the categorical classification task and OOV words attribute prediction tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Text Readability and Simplification · Topic Modeling