Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word   Vectors

Vitalii Zhelezniak; Aleksandar Savkov; April Shen; Francesco; Moramarco; Jack Flann; Nils Y. Hammerla

arXiv:1904.13264·cs.CL·May 1, 2019·24 cites

Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors

Vitalii Zhelezniak, Aleksandar Savkov, April Shen, Francesco, Moramarco, Jack Flann, Nils Y. Hammerla

PDF

Open Access 2 Repos

TL;DR

This paper introduces a fuzzy bag-of-words model and DynaMax similarity measure that leverage max-pooling and fuzzy set theory to significantly improve semantic textual similarity tasks, outperforming existing methods.

Contribution

It proposes a novel fuzzy bag-of-words representation and an unsupervised DynaMax similarity measure, advancing the state-of-the-art in semantic textual similarity evaluation.

Findings

01

Outperforms current baselines on STS tasks

02

Fuzzy BoW with max-pooling outperforms cosine similarity

03

DynaMax is efficient and competitive with supervised methods

Abstract

Recent literature suggests that averaged word vectors followed by simple post-processing outperform many deep learning methods on semantic textual similarity tasks. Furthermore, when averaged word vectors are trained supervised on large corpora of paraphrases, they achieve state-of-the-art results on standard STS benchmarks. Inspired by these insights, we push the limits of word embeddings even further. We propose a novel fuzzy bag-of-words (FBoW) representation for text that contains all the words in the vocabulary simultaneously but with different degrees of membership, which are derived from similarities between word vectors. We show that max-pooled word vectors are only a special case of fuzzy BoW and should be compared via fuzzy Jaccard index rather than cosine similarity. Finally, we propose DynaMax, a completely unsupervised and non-parametric similarity measure that dynamically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies