Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs
Denis Mazur, Vage Egiazarian, Stanislav Morozov, Artem Babenko

TL;DR
This paper introduces PRODIGE, a novel method for learning data representations as weighted graphs with shortest path distances, enabling flexible, non-Euclidean geometry modeling that outperforms traditional embedding techniques across various tasks.
Contribution
PRODIGE is the first end-to-end differentiable approach to learn weighted graph representations, removing geometric biases inherent in Euclidean embeddings.
Findings
PRODIGE outperforms existing embedding methods in classification tasks.
The approach effectively models complex data geometries.
Experimental results show improved compression and collaborative filtering performance.
Abstract
Learning useful representations is a key ingredient to the success of modern machine learning. Currently, representation learning mostly relies on embedding data into Euclidean space. However, recent work has shown that data in some domains is better modeled by non-euclidean metric spaces, and inappropriate geometry can result in inferior performance. In this paper, we aim to eliminate the inductive bias imposed by the embedding space geometry. Namely, we propose to map data into more general non-vector metric spaces: a weighted graph with a shortest path distance. By design, such graphs can model arbitrary geometry with a proper configuration of edges and weights. Our main contribution is PRODIGE: a method that learns a weighted graph representation of data end-to-end by gradient descent. Greater generality and fewer model assumptions make PRODIGE more powerful than existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Domain Adaptation and Few-Shot Learning · Privacy-Preserving Technologies in Data
