TL;DR
This paper conducts a large-scale comparison of lexical substitution methods using various language models, demonstrating that target word information injection significantly enhances performance and achieves new state-of-the-art results in word sense induction.
Contribution
It introduces and evaluates new target word injection techniques across multiple models, improving lexical substitution and WSI performance.
Findings
Target word injection improves model performance
Achieved new state-of-the-art results on WSI datasets
Analyzed semantic relation types in generated substitutes
Abstract
Lexical substitution, i.e. generation of plausible words that can replace a particular target word in a given context, is an extremely powerful technology that can be used as a backbone of various NLP applications, including word sense induction and disambiguation, lexical relation extraction, data augmentation, etc. In this paper, we present a large-scale comparative study of lexical substitution methods employing both rather old and most recent language and masked language models (LMs and MLMs), such as context2vec, ELMo, BERT, RoBERTa, XLNet. We show that already competitive results achieved by SOTA LMs/MLMs can be further substantially improved if information about the target word is injected properly. Several existing and new target word injection methods are compared for each LM/MLM using both intrinsic evaluation on lexical substitution datasets and extrinsic evaluation on word…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Tanh Activation · Weight Decay · Layer Normalization · Sigmoid Activation · Attention Dropout · Dropout · Multi-Head Attention
