Why PairDiff works? -- A Mathematical Analysis of Bilinear Relational   Compositional Operators for Analogy Detection

Huda Hakami; Danushka Bollegala; Hayashi Kohei

arXiv:1709.06673·cs.CL·December 18, 2017

Why PairDiff works? -- A Mathematical Analysis of Bilinear Relational Compositional Operators for Analogy Detection

Huda Hakami, Danushka Bollegala, Hayashi Kohei

PDF

Open Access

TL;DR

This paper provides a theoretical explanation for why the vector offset method (PairDiff) effectively captures semantic relations in word embeddings, showing it as a simplified case of more general bilinear operators under certain conditions.

Contribution

It offers a mathematical analysis demonstrating that PairDiff is a special case of bilinear operators, valid under standardization and uncorrelation of embeddings, supported by empirical verification.

Findings

01

PairDiff is a simplified form of bilinear operators under certain conditions.

02

Empirical verification of uncorrelation assumption across various embeddings.

03

PairDiff performs well on benchmark analogy datasets.

Abstract

Representing the semantic relations that exist between two given words (or entities) is an important first step in a wide-range of NLP applications such as analogical reasoning, knowledge base completion and relational information retrieval. A simple, yet surprisingly accurate method for representing a relation between two words is to compute the vector offset (\PairDiff) between their corresponding word embeddings. Despite the empirical success, it remains unclear as to whether \PairDiff is the best operator for obtaining a relational representation from word embeddings. We conduct a theoretical analysis of generalised bilinear operators that can be used to measure the $ℓ_{2}$ relational distance between two word-pairs. We show that, if the word embeddings are standardised and uncorrelated, such an operator will be independent of bilinear terms, and can be simplified to a linear…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies