Towards a Theoretical Understanding of Word and Relation Representation

Carl Allen

arXiv:2202.00486·cs.CL·February 2, 2022·1 cites

Towards a Theoretical Understanding of Word and Relation Representation

Carl Allen

PDF

Open Access

TL;DR

This paper provides a theoretical framework explaining how geometric relationships in word and knowledge graph embeddings encode semantic information, enhancing interpretability and understanding of these models.

Contribution

It offers a theoretical justification for the geometric-semantic link in word embeddings and extends this understanding to knowledge graph representations.

Findings

01

Geometric relationships in embeddings correspond to semantic relations.

02

Theoretical model links word embeddings and knowledge graph structures.

03

Improves interpretability of embedding models.

Abstract

Representing words by vectors, or embeddings, enables computational reasoning and is foundational to automating natural language tasks. For example, if word embeddings of similar words contain similar values, word similarity can be readily assessed, whereas judging that from their spelling is often impossible (e.g. cat /feline) and to predetermine and store similarities between all words is prohibitively time-consuming, memory intensive and subjective. We focus on word embeddings learned from text corpora and knowledge graphs. Several well-known algorithms learn word embeddings from text on an unsupervised basis by learning to predict those words that occur around each word, e.g. word2vec and GloVe. Parameters of such word embeddings are known to reflect word co-occurrence statistics, but how they capture semantic meaning has been unclear. Knowledge graph representation models learn…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies

MethodsGloVe Embeddings