Bad Form: Comparing Context-Based and Form-Based Few-Shot Learning in   Distributional Semantic Models

Jeroen Van Hautte; Guy Emerson; Marek Rei

arXiv:1910.00275·cs.CL·October 2, 2019

Bad Form: Comparing Context-Based and Form-Based Few-Shot Learning in Distributional Semantic Models

Jeroen Van Hautte, Guy Emerson, Marek Rei

PDF

TL;DR

This paper compares context-based and form-based few-shot learning methods in distributional semantic models, introduces new evaluation tasks, and demonstrates that hyperparameter tuning significantly improves model performance, setting new benchmarks.

Contribution

It introduces three new tasks for better evaluation of form-based models and highlights the importance of hyperparameter tuning in improving model performance.

Findings

01

Form-based models can leverage word form information in training data.

02

Hyperparameter tuning improves performance across models.

03

Achieved state-of-the-art results on 4 out of 6 tasks.

Abstract

Word embeddings are an essential component in a wide range of natural language processing applications. However, distributional semantic models are known to struggle when only a small number of context sentences are available. Several methods have been proposed to obtain higher-quality vectors for these words, leveraging both this context information and sometimes the word forms themselves through a hybrid approach. We show that the current tasks do not suffice to evaluate models that use word-form information, as such models can easily leverage word forms in the training data that are related to word forms in the test data. We introduce 3 new tasks, allowing for a more balanced comparison between models. Furthermore, we show that hyperparameters that have largely been ignored in previous work can consistently improve the performance of both baseline and advanced models, achieving a new…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTest