TL;DR
This paper introduces a computational framework that models and generates slang by relating conventional and slang meanings, incorporating context, and outperforming existing models in predicting slang usage and emergence over decades.
Contribution
It presents a novel framework combining probabilistic inference and neural contrastive learning for slang generation and understanding, addressing data scarcity and language flexibility.
Findings
Outperforms state-of-the-art language models in slang prediction
Better predicts historical slang emergence from 1960s to 2000s
Semantic space is sensitive to slang-conventional sense similarities
Abstract
Slang is a common type of informal language, but its flexible nature and paucity of data resources present challenges for existing natural language systems. We take an initial step toward machine generation of slang by developing a framework that models the speaker's word choice in slang context. Our framework encodes novel slang meaning by relating the conventional and slang senses of a word while incorporating syntactic and contextual knowledge in slang usage. We construct the framework using a combination of probabilistic inference and neural contrastive learning. We perform rigorous evaluations on three slang dictionaries and show that our approach not only outperforms state-of-the-art language models, but also better predicts the historical emergence of slang word usages from 1960s to 2000s. We interpret the proposed models and find that the contrastively learned semantic space is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
