TRANSMUT-SPARK: Transformation Mutation for Apache Spark
Joao Batista de Souza Neto, Anamaria Martins Moreira, Genoveva, Vargas-Solar, Martin A. Musicante

TL;DR
TRANSMUT-Spark is a tool that automates mutation testing for Apache Spark programs, helping developers identify faults in complex Big Data processing code to improve reliability and correctness.
Contribution
The paper introduces TRANSMUT-Spark, a novel tool that automates mutation testing specifically for Spark programs, streamlining fault detection in Big Data applications.
Findings
TRANSMUT-Spark successfully automates mutant generation and testing phases.
Experimental results validate the tool's effectiveness and scope.
The approach improves fault detection in Spark Big Data processing code.
Abstract
We propose TRANSMUT-Spark, a tool that automates the mutation testing process of Big Data processing code within Spark programs. Apache Spark is an engine for Big Data Processing. It hides the complexity inherent to Big Data parallel and distributed programming and processing through built-in functions, underlying parallel processes, and data management strategies. Nonetheless, programmers must cleverly combine these functions within programs and guide the engine to use the right data management strategies to exploit the large number of computational resources required by Big Data processing and avoid substantial production losses. Many programming details in data processing code within Spark programs are prone to false statements that need to be correctly and automatically tested. This paper explores the application of mutation testing in Spark programs, a fault-based testing technique…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Software Testing and Debugging Techniques · Cloud Computing and Resource Management
