A Context-theoretic Framework for Compositionality in Distributional Semantics
Daoud Clarke

TL;DR
This paper introduces a formal framework for compositional distributional semantics where words, phrases, and sentences are represented as vectors, grounded in context and algebraic structures, unifying and generalizing existing vector composition methods.
Contribution
It provides a novel theoretical framework linking vector space models with formal semantics, incorporating algebraic and lattice structures to model meaning and entailment.
Findings
Vectors form an algebra over a field for representing meanings.
Lattice structure models entailment between meanings.
Framework generalizes existing vector composition methods.
Abstract
Techniques in which words are represented as vectors have proved useful in many applications in computational linguistics, however there is currently no general semantic formalism for representing meaning in terms of vectors. We present a framework for natural language semantics in which words, phrases and sentences are all represented as vectors, based on a theoretical analysis which assumes that meaning is determined by context. In the theoretical analysis, we define a corpus model as a mathematical abstraction of a text corpus. The meaning of a string of words is assumed to be a vector representing the contexts in which it occurs in the corpus model. Based on this assumption, we can show that the vector representations of words can be considered as elements of an algebra over a field. We note that in applications of vector spaces to representing meanings of words there is an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Advanced Graph Neural Networks
