Context-gloss Augmentation for Improving Word Sense Disambiguation

Guan-Ting Lin; Manuel Giambi

arXiv:2110.07174·cs.CL·October 15, 2021·1 cites

Context-gloss Augmentation for Improving Word Sense Disambiguation

Guan-Ting Lin, Manuel Giambi

PDF

Open Access

TL;DR

This paper explores various data augmentation techniques, including back translation and hypernym gloss addition, to enhance BERT-based Word Sense Disambiguation, demonstrating significant performance improvements.

Contribution

It introduces and compares multiple context-gloss augmentation methods, highlighting the effectiveness of back translation and hypernym gloss addition for WSD.

Findings

01

Back translation on gloss yields the best performance.

02

Adding hypernyms' glosses improves WSD accuracy.

03

Sentence-level and word-level augmentations are both effective.

Abstract

The goal of Word Sense Disambiguation (WSD) is to identify the sense of a polysemous word in a specific context. Deep-learning techniques using BERT have achieved very promising results in the field and different methods have been proposed to integrate structured knowledge to enhance performance. At the same time, an increasing number of data augmentation techniques have been proven to be useful for NLP tasks. Building upon previous works leveraging BERT and WordNet knowledge, we explore different data augmentation techniques on context-gloss pairs to improve the performance of WSD. In our experiment, we show that both sentence-level and word-level augmentation methods are effective strategies for WSD. Also, we find out that performance can be improved by adding hypernyms' glosses obtained from a lexical knowledge base. We compare and analyze different context-gloss augmentation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Dropout · Layer Normalization · Dense Connections · Softmax · Residual Connection · Attention Dropout