Ontology-based Design of Experiments on Big Data Solutions
Maximilian Zocholl, Elena Camossi, Anne-Laure Jousselme, Cyril Ray

TL;DR
This paper introduces an ontology-based framework for designing and evaluating complex big data solutions, improving experiment reproducibility, component analysis, and experiment selection to optimize testing processes.
Contribution
It presents a novel ontology-based approach for formalizing and supporting the design of experiments on big data solutions, enabling better decomposition, aggregation, and experiment selection.
Findings
Ontology formalizes big data solution decomposition.
Reduces number of experiments via domain restrictions.
Supports building DoE from scratch with rich descriptions.
Abstract
Big data solutions are designed to cope with data of huge Volume and wide Variety, that need to be ingested at high Velocity and have potential Veracity issues, challenging characteristics that are usually referred to as the "4Vs of Big Data". In order to evaluate possibly complex big data solutions, stress tests require to assess a large number of combinations of sub-components jointly with the possible big data variations. A formalization of the Design of Experiments (DoE) on big data solutions is aimed at ensuring the reproducibility of the experiments, facilitating their partitioning in sub-experiments and guaranteeing the consistency of their outcomes in a global assessment. In this paper, an ontology-based approach is proposed to support the evaluation of a big data system in two ways. Firstly, the approach formalizes a decomposition and recombination of the big data solution,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Software Testing and Debugging Techniques · Software Engineering Research
