Towards a Theoretical Understanding of Word and Relation Representation
Carl Allen

TL;DR
This paper provides a theoretical framework explaining how geometric relationships in word and knowledge graph embeddings encode semantic information, enhancing interpretability and understanding of these models.
Contribution
It offers a theoretical justification for the geometric-semantic link in word embeddings and extends this understanding to knowledge graph representations.
Findings
Geometric relationships in embeddings correspond to semantic relations.
Theoretical model links word embeddings and knowledge graph structures.
Improves interpretability of embedding models.
Abstract
Representing words by vectors, or embeddings, enables computational reasoning and is foundational to automating natural language tasks. For example, if word embeddings of similar words contain similar values, word similarity can be readily assessed, whereas judging that from their spelling is often impossible (e.g. cat /feline) and to predetermine and store similarities between all words is prohibitively time-consuming, memory intensive and subjective. We focus on word embeddings learned from text corpora and knowledge graphs. Several well-known algorithms learn word embeddings from text on an unsupervised basis by learning to predict those words that occur around each word, e.g. word2vec and GloVe. Parameters of such word embeddings are known to reflect word co-occurrence statistics, but how they capture semantic meaning has been unclear. Knowledge graph representation models learn…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
MethodsGloVe Embeddings
