Probabilistic FastText for Multi-Sense Word Embeddings
Ben Athiwaratkun, Andrew Gordon Wilson, Anima Anandkumar

TL;DR
Probabilistic FastText introduces a Gaussian mixture model for word embeddings that captures multiple senses, sub-word information, and uncertainty, outperforming previous models on various semantic benchmarks.
Contribution
The paper presents Probabilistic FastText, a novel probabilistic model for multi-sense word embeddings that leverages sub-word structures and uncertainty, improving semantic representation especially for rare and unseen words.
Findings
Outperforms FastText and dictionary-based probabilistic embeddings on word similarity benchmarks.
Achieves state-of-the-art results in multi-sense word similarity tasks.
Effectively models rare, misspelt, and unseen words using sub-word information.
Abstract
We introduce Probabilistic FastText, a new model for word embeddings that can capture multiple word senses, sub-word structure, and uncertainty information. In particular, we represent each word with a Gaussian mixture density, where the mean of a mixture component is given by the sum of n-grams. This representation allows the model to share statistical strength across sub-word structures (e.g. Latin roots), producing accurate representations of rare, misspelt, or even unseen words. Moreover, each component of the mixture can capture a different word sense. Probabilistic FastText outperforms both FastText, which has no probabilistic model, and dictionary-level probabilistic embeddings, which do not incorporate subword structures, on several word-similarity benchmarks, including English RareWord and foreign language datasets. We also achieve state-of-art performance on benchmarks that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsfastText
