Rover: An online Spark SQL tuning service via generalized transfer learning
Yu Shen, Xinyuyang Ren, Yupeng Lu, Huaijun Jiang, Huanyong Xu, Di, Peng, Yang Li, Wentao Zhang, Bin Cui

TL;DR
Rover is an online Spark SQL tuning service that leverages generalized transfer learning, combining expert knowledge and controlled history transfer, to efficiently optimize configurations and significantly reduce memory costs in industrial workloads.
Contribution
The paper introduces Rover, a novel online tuning system that integrates expert-assisted Bayesian optimization and controlled transfer learning for improved Spark SQL performance.
Findings
Rover reduces memory costs by 50.1% on average across 12,000 tasks.
76.2% of tasks achieve over 60% memory reduction.
Outperforms existing tuning baselines in efficiency and safety.
Abstract
Distributed data analytic engines like Spark are common choices to process massive data in industry. However, the performance of Spark SQL highly depends on the choice of configurations, where the optimal ones vary with the executed workloads. Among various alternatives for Spark SQL tuning, Bayesian optimization (BO) is a popular framework that finds near-optimal configurations given sufficient budget, but it suffers from the re-optimization issue and is not practical in real production. When applying transfer learning to accelerate the tuning process, we notice two domain-specific challenges: 1) most previous work focus on transferring tuning history, while expert knowledge from Spark engineers is of great potential to improve the tuning performance but is not well studied so far; 2) history tasks should be carefully utilized, where using dissimilar ones lead to a deteriorated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Metabolomics and Mass Spectrometry Studies
Methodstravel james
