A Method for Estimating the Proximity of Vector Representation Groups in   Multidimensional Space. On the Example of the Paraphrase Task

Artem Artemov; Boris Alekseev

arXiv:1908.09341·cs.LG·August 30, 2019

A Method for Estimating the Proximity of Vector Representation Groups in Multidimensional Space. On the Example of the Paraphrase Task

Artem Artemov, Boris Alekseev

PDF

Open Access

TL;DR

This paper introduces a novel method for comparing two sets of vectors to measure the semantic similarity of sentences, effectively capturing word order and syntactic nuances without aggregation.

Contribution

It presents a new vector comparison technique based on cosine of angles between projections, improving semantic similarity measurement in sentence comparison tasks.

Findings

01

Method outperforms traditional aggregation-based approaches

02

Effective in capturing word order and syntactic relations

03

Validated on Russian sentence pairs

Abstract

The following paper presents a method of comparing two sets of vectors. The method can be applied in all tasks, where it is necessary to measure the closeness of two objects presented as sets of vectors. It may be applicable when we compare the meanings of two sentences as part of the problem of paraphrasing. This is the problem of measuring semantic similarity of two sentences (group of words). The existing methods are not sensible for the word order or syntactic connections in the considered sentences. The method appears to be advantageous because it neither presents a group of words as one scalar value, nor does it try to show the closeness through an aggregation vector, which is mean for the set of vectors. Instead of that we measure the cosine of the angle as the mean for the first group vectors projections (the context) on one side and each vector of the second group on the other…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Authorship Attribution and Profiling