Transfer Learning and Augmentation for Word Sense Disambiguation

Harsh Kohli

arXiv:2101.03617·cs.IR·May 18, 2021

Transfer Learning and Augmentation for Word Sense Disambiguation

Harsh Kohli

PDF

TL;DR

This paper presents a transfer learning and data augmentation pipeline that significantly improves Word Sense Disambiguation performance, achieving state-of-the-art results with a single model comparable to ensemble methods.

Contribution

It introduces a novel combination of transfer learning and data augmentation techniques specifically tailored for Word Sense Disambiguation.

Findings

01

Achieves state-of-the-art single model WSD performance

02

Matches top ensemble results in WSD

03

Demonstrates effectiveness of combined transfer learning and augmentation

Abstract

Many downstream NLP tasks have shown significant improvement through continual pre-training, transfer learning and multi-task learning. State-of-the-art approaches in Word Sense Disambiguation today benefit from some of these approaches in conjunction with information sources such as semantic relationships and gloss definitions contained within WordNet. Our work builds upon these systems and uses data augmentation along with extensive pre-training on various different NLP tasks and datasets. Our transfer learning and augmentation pipeline achieves state-of-the-art single model performance in WSD and is at par with the best ensemble results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.