English Out-of-Vocabulary Lexical Evaluation Task
Han Wang, Ye Wang, Xinxiang Zhang, Mi Lu, Yoonsuck Choe, Jingjing Cao

TL;DR
This paper introduces a novel OOV lexical evaluation task focusing on classifying and predicting attributes of out-of-vocabulary words without prior knowledge, using unsupervised embeddings for baseline experiments.
Contribution
It pioneers an OOV lexical evaluation framework that does not rely on prior knowledge and applies unsupervised embeddings for classification and attribute prediction.
Findings
Baseline experiments with Word2Vec and Word2GM demonstrate effectiveness.
The task provides a new benchmark for OOV lexical evaluation.
Annotator-based attribute inference is feasible without prior knowledge.
Abstract
Unlike previous unknown nouns tagging task, this is the first attempt to focus on out-of-vocabulary (OOV) lexical evaluation tasks that do not require any prior knowledge. The OOV words are words that only appear in test samples. The goal of tasks is to provide solutions for OOV lexical classification and prediction. The tasks require annotators to conclude the attributes of the OOV words based on their related contexts. Then, we utilize unsupervised word embedding methods such as Word2Vec and Word2GM to perform the baseline experiments on the categorical classification task and OOV words attribute prediction tasks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Text Readability and Simplification · Topic Modeling
