Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors
Vitalii Zhelezniak, Aleksandar Savkov, April Shen, Francesco, Moramarco, Jack Flann, Nils Y. Hammerla

TL;DR
This paper introduces a fuzzy bag-of-words model and DynaMax similarity measure that leverage max-pooling and fuzzy set theory to significantly improve semantic textual similarity tasks, outperforming existing methods.
Contribution
It proposes a novel fuzzy bag-of-words representation and an unsupervised DynaMax similarity measure, advancing the state-of-the-art in semantic textual similarity evaluation.
Findings
Outperforms current baselines on STS tasks
Fuzzy BoW with max-pooling outperforms cosine similarity
DynaMax is efficient and competitive with supervised methods
Abstract
Recent literature suggests that averaged word vectors followed by simple post-processing outperform many deep learning methods on semantic textual similarity tasks. Furthermore, when averaged word vectors are trained supervised on large corpora of paraphrases, they achieve state-of-the-art results on standard STS benchmarks. Inspired by these insights, we push the limits of word embeddings even further. We propose a novel fuzzy bag-of-words (FBoW) representation for text that contains all the words in the vocabulary simultaneously but with different degrees of membership, which are derived from similarities between word vectors. We show that max-pooled word vectors are only a special case of fuzzy BoW and should be compared via fuzzy Jaccard index rather than cosine similarity. Finally, we propose DynaMax, a completely unsupervised and non-parametric similarity measure that dynamically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
