UoB at SemEval-2020 Task 1: Automatic Identification of Novel Word Senses
Eleri Sarsfield, Harish Tayyar Madabushi

TL;DR
This paper introduces a Bayesian word sense induction method for detecting lexical semantic change, successfully applied to SemEval-2020 Task 1 and Twitter data to identify novel word senses and slang.
Contribution
It presents a novel Bayesian approach for lexical semantic change detection, capable of identifying new word senses and slang in large-scale social media data.
Findings
Effective in SemEval-2020 Task 1
Identifies potential slang words from Twitter data
Demonstrates capability for novel sense detection
Abstract
Much as the social landscape in which languages are spoken shifts, language too evolves to suit the needs of its users. Lexical semantic change analysis is a burgeoning field of semantic analysis which aims to trace changes in the meanings of words over time. This paper presents an approach to lexical semantic change detection based on Bayesian word sense induction suitable for novel word sense identification. This approach is used for a submission to SemEval-2020 Task 1, which shows the approach to be capable of the SemEval task. The same approach is also applied to a corpus gleaned from 15 years of Twitter data, the results of which are then used to identify words which may be instances of slang.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Authorship Attribution and Profiling · Linguistic Variation and Morphology
