Non-Parametric Few-Shot Learning for Word Sense Disambiguation

Howard Chen; Mengzhou Xia; and Danqi Chen

arXiv:2104.12677·cs.CL·April 28, 2021

Non-Parametric Few-Shot Learning for Word Sense Disambiguation

Howard Chen, Mengzhou Xia, and Danqi Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces MetricWSD, a non-parametric few-shot learning method for word sense disambiguation that effectively handles data imbalance by leveraging episodic training to transfer knowledge from frequent to infrequent words, achieving strong results without lexical resources.

Contribution

It proposes a novel non-parametric approach that addresses data imbalance in WSD by episodic training, outperforming parametric models without using lexical resources.

Findings

01

Achieves 75.1 F1 score on WSD benchmark

02

Significant improvement for infrequent words and senses

03

Effective transfer of knowledge from high-frequency to low-frequency words

Abstract

Word sense disambiguation (WSD) is a long-standing problem in natural language processing. One significant challenge in supervised all-words WSD is to classify among senses for a majority of words that lie in the long-tail distribution. For instance, 84% of the annotated words have less than 10 examples in the SemCor training data. This issue is more pronounced as the imbalance occurs in both word and sense distributions. In this work, we propose MetricWSD, a non-parametric few-shot learning approach to mitigate this data imbalance issue. By learning to compute distances among the senses of a given word through episodic training, MetricWSD transfers knowledge (a learned metric space) from high-frequency words to infrequent ones. MetricWSD constructs the training episodes tailored to word frequencies and explicitly addresses the problem of the skewed distribution, as opposed to mixing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

princeton-nlp/metric-wsd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems