Benchmarking data warehouses

J\'er\^ome Darmont (ERIC); Fadila Bentayeb (ERIC); Omar Boussa\"id; (ERIC)

arXiv:1701.00399·cs.DB·January 3, 2017

Benchmarking data warehouses

J\'er\^ome Darmont (ERIC), Fadila Bentayeb (ERIC), Omar Boussa\"id, (ERIC)

PDF

TL;DR

This paper introduces DWEB, a flexible benchmarking tool for data warehouses that enables testing various schemas and workloads to evaluate performance and support design decisions.

Contribution

The paper presents DWEB, a tunable, Java-based benchmark for data warehouses that models different schemas and workloads, addressing limitations of existing benchmarks.

Findings

01

DWEB can generate diverse synthetic data warehouses.

02

DWEB is compatible with most relational databases.

03

Experiments demonstrate DWEB's usefulness for performance assessment.

Abstract

Data warehouse architectural choices and optimization techniques are critical to decision support query performance. To facilitate these choices, the performance of the designed data warehouse must be assessed, usually with benchmarks. These tools can either help system users comparing the performances of different systems, or help system engineers testing the effect of various design choices. While the Transaction Processing Performance Council's standard benchmarks address the first point, they are not tunable enough to address the second one and fail to model different data warehouse schemas. By contrast, our Data Warehouse Engineering Benchmark (DWEB) allows generating various ad-hoc synthetic data warehouses and workloads. DWEB is implemented as a Java free software that can be interfaced with most existing relational database management systems. The full specifications of DWEB, as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.