# Stochastic Graphlet Embedding

**Authors:** Anjan Dutta, Hichem Sahbi

arXiv: 1702.00156 · 2019-01-01

## TL;DR

This paper introduces a stochastic graphlet embedding method that efficiently maps graphs into vector spaces by sampling high-order graphlets, improving pattern recognition tasks through robust, discriminative representations.

## Contribution

The paper presents a novel stochastic search procedure for extracting high-order graphlets and a hash-based distribution measurement for graph embedding, enhancing graph representation.

## Key findings

- Improved pattern recognition performance on benchmark datasets.
- Efficient sampling of high-order graphlets with low collision probability.
- Enhanced discriminability and robustness of graph embeddings.

## Abstract

Graph-based methods are known to be successful in many machine learning and pattern classification tasks. These methods consider semi-structured data as graphs where nodes correspond to primitives (parts, interest points, segments, etc.) and edges characterize the relationships between these primitives. However, these non-vectorial graph data cannot be straightforwardly plugged into off-the-shelf machine learning algorithms without a preliminary step of -- explicit/implicit -- graph vectorization and embedding. This embedding process should be resilient to intra-class graph variations while being highly discriminant. In this paper, we propose a novel high-order stochastic graphlet embedding (SGE) that maps graphs into vector spaces. Our main contribution includes a new stochastic search procedure that efficiently parses a given graph and extracts/samples unlimitedly high-order graphlets. We consider these graphlets, with increasing orders, to model local primitives as well as their increasingly complex interactions. In order to build our graph representation, we measure the distribution of these graphlets into a given graph, using particular hash functions that efficiently assign sampled graphlets into isomorphic sets with a very low probability of collision. When combined with maximum margin classifiers, these graphlet-based representations have positive impact on the performance of pattern comparison and recognition as corroborated through extensive experiments using standard benchmark databases.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1702.00156/full.md

## Figures

62 figures with captions in the complete paper: https://tomesphere.com/paper/1702.00156/full.md

## References

54 references — full list in the complete paper: https://tomesphere.com/paper/1702.00156/full.md

---
Source: https://tomesphere.com/paper/1702.00156