# Towards a property graph generator for benchmarking

**Authors:** Arnau Prat-P\'erez, Joan Guisado-G\'amez, Xavier Fern\'andez Salas,, Petr Koupy, Siegfried Depner, Davide Basilio Bartolini

arXiv: 1704.00630 · 2017-04-04

## TL;DR

This paper presents DataSynth, a flexible framework for generating customizable property graphs that can model complex property-structure correlations, aiding benchmark designers in efficient large-scale graph creation.

## Contribution

Introduction of DataSynth, a novel property graph generator with customizable schemas and correlation modeling, reducing the need for ad-hoc generator development.

## Key findings

- Preliminary promising results for the property-to-node matching algorithm
- Framework supports customizable schemas and characteristics
- Models correlation between properties and graph structure

## Abstract

The use of synthetic graph generators is a common practice among graph-oriented benchmark designers, as it allows obtaining graphs with the required scale and characteristics. However, finding a graph generator that accurately fits the needs of a given benchmark is very difficult, thus practitioners end up creating ad-hoc ones. Such a task is usually time-consuming, and often leads to reinventing the wheel. In this paper, we introduce the conceptual design of DataSynth, a framework for property graphs generation with customizable schemas and characteristics. The goal of DataSynth is to assist benchmark designers in generating graphs efficiently and at scale, saving from implementing their own generators. Additionally, DataSynth introduces novel features barely explored so far, such as modeling the correlation between properties and the structure of the graph. This is achieved by a novel property-to-node matching algorithm for which we present preliminary promising results.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1704.00630/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/1704.00630/full.md

## References

23 references — full list in the complete paper: https://tomesphere.com/paper/1704.00630/full.md

---
Source: https://tomesphere.com/paper/1704.00630