On Optimizing Operator Fusion Plans for Large-Scale Machine Learning in SystemML
Matthias Boehm, Berthold Reinwald, Dylan Hutchison, Alexandre V., Evfimievski, Prithviraj Sen

TL;DR
This paper presents an optimization framework for operator fusion in large-scale ML systems, improving execution plans by systematically considering various factors to enhance performance significantly.
Contribution
It introduces algorithms for exploring, selecting, and generating fused operator plans that handle complex DAGs, sparsity, and distributed operations in SystemML.
Findings
Up to 21x performance improvement over hand-written fused operators
Effective handling of complex DAGs and hybrid local-distributed plans
Negligible overhead from the optimization process
Abstract
Many large-scale machine learning (ML) systems allow specifying custom ML algorithms by means of linear algebra programs, and then automatically generate efficient execution plans. In this context, optimization opportunities for fused operators---in terms of fused chains of basic operators---are ubiquitous. These opportunities include (1) fewer materialized intermediates, (2) fewer scans of input data, and (3) the exploitation of sparsity across chains of operators. Automatic operator fusion eliminates the need for hand-written fused operators and significantly improves performance for complex or previously unseen chains of operations. However, existing fusion heuristics struggle to find good fusion plans for complex DAGs or hybrid plans of local and distributed operations. In this paper, we introduce an optimization framework for systematically reason about fusion plans that considers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFormal Methods in Verification · Software Engineering Research · Manufacturing Process and Optimization
