Complex-valued embeddings of generic proximity data
Maximilian M\"unch, Michiel Straat, Michael Biehl and, Frank-Michael Schleif

TL;DR
This paper introduces a complex-valued embedding method for proximity data that preserves information and enables effective machine learning, especially for non-metric and non-psd measures, outperforming traditional techniques.
Contribution
It proposes a novel complex-valued embedding approach for proximity data, addressing limitations of standard embeddings in preserving information and enabling complex-valued machine learning algorithms.
Findings
Strong performance on standard benchmarks
Effective handling of non-metric and non-psd proximity data
Outperforms traditional embedding techniques
Abstract
Proximities are at the heart of almost all machine learning methods. If the input data are given as numerical vectors of equal lengths, euclidean distance, or a Hilbertian inner product is frequently used in modeling algorithms. In a more generic view, objects are compared by a (symmetric) similarity or dissimilarity measure, which may not obey particular mathematical properties. This renders many machine learning methods invalid, leading to convergence problems and the loss of guarantees, like generalization bounds. In many cases, the preferred dissimilarity measure is not metric, like the earth mover distance, or the similarity measure may not be a simple inner product in a Hilbert space but in its generalization a Krein space. If the input data are non-vectorial, like text sequences, proximity-based learning is used or ngram embedding techniques can be applied. Standard embeddings…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
