SMSMix: Sense-Maintained Sentence Mixup for Word Sense Disambiguation
Hee Suk Yoon, Eunseop Yoon, John Harvill, Sunjae Yoon, Mark, Hasegawa-Johnson, Chang D. Yoo

TL;DR
This paper introduces SMSMix, a novel data augmentation technique for Word Sense Disambiguation that preserves the sense of target words during sentence mixing, improving recognition of rare senses.
Contribution
It proposes the first sense-preserving mixup method for NLP, enhancing training data for better rare sense recognition in WSD tasks.
Findings
Improves recognition of rare word senses in WSD
Effectively maintains target word sense during augmentation
Enhances training data diversity for better model performance
Abstract
Word Sense Disambiguation (WSD) is an NLP task aimed at determining the correct sense of a word in a sentence from discrete sense choices. Although current systems have attained unprecedented performances for such tasks, the nonuniform distribution of word senses during training generally results in systems performing poorly on rare senses. To this end, we consider data augmentation to increase the frequency of these least frequent senses (LFS) to reduce the distributional bias of senses during training. We propose Sense-Maintained Sentence Mixup (SMSMix), a novel word-level mixup method that maintains the sense of a target word. SMSMix smoothly blends two sentences using mask prediction while preserving the relevant span determined by saliency scores to maintain a specific word's sense. To the best of our knowledge, this is the first attempt to apply mixup in NLP while preserving the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
MethodsMixup
