Exponential Family Embeddings
Maja R. Rudolph, Francisco J. R. Ruiz, Stephan Mandt, David M. Blei

TL;DR
This paper introduces exponential family embeddings, a versatile framework extending word embeddings to various high-dimensional data types, demonstrating improved data reconstruction and structure discovery across neuroscience, market basket, and recommendation systems.
Contribution
The paper develops a general class of embeddings for different data types, with scalable inference algorithms, broadening the applicability of embedding techniques beyond language.
Findings
Outperformed other dimension reduction methods in all applications
Better reconstruction of held-out data
Discovered meaningful qualitative structures
Abstract
Word embeddings are a powerful approach for capturing semantic similarity among terms in a vocabulary. In this paper, we develop exponential family embeddings, a class of methods that extends the idea of word embeddings to other types of high-dimensional data. As examples, we studied neural data with real-valued observations, count data from a market basket analysis, and ratings data from a movie recommendation system. The main idea is to model each observation conditioned on a set of other observations. This set is called the context, and the way the context is defined is a modeling choice that depends on the problem. In language the context is the surrounding words; in neuroscience the context is close-by neurons; in market basket data the context is other items in the shopping cart. Each type of embedding model defines the context, the exponential family of conditional distributions,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFamily Dynamics and Relationships
