Context Mover's Distance & Barycenters: Optimal Transport of Contexts for Building Representations
Sidak Pal Singh, Andreas Hug, Aymeric Dieuleveut, Martin Jaggi

TL;DR
This paper introduces a novel unsupervised representation learning framework using optimal transport, modeling entities as probability distributions over contexts, leading to improved performance in semantic tasks.
Contribution
It proposes a new method that leverages Wasserstein distances and barycenters for representing entities as distributions, enhancing interpretability and capturing polysemy.
Findings
Achieved 4.1% relative improvement over Sent2vec and GenSen in semantic tasks.
Demonstrated the effectiveness of distribution-based representations in measuring sentence similarity.
Showcased the interpretability and flexibility of the optimal transport approach.
Abstract
We present a framework for building unsupervised representations of entities and their compositions, where each entity is viewed as a probability distribution rather than a vector embedding. In particular, this distribution is supported over the contexts which co-occur with the entity and are embedded in a suitable low-dimensional space. This enables us to consider representation learning from the perspective of Optimal Transport and take advantage of its tools such as Wasserstein distance and barycenters. We elaborate how the method can be applied for obtaining unsupervised representations of text and illustrate the performance (quantitatively as well as qualitatively) on tasks such as measuring sentence similarity, word entailment and similarity, where we empirically observe significant gains (e.g., 4.1% relative improvement over Sent2vec, GenSen). The key benefits of the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsInterpretability
