Strategies for Optimizing End-to-End Artificial Intelligence Pipelines on Intel Xeon Processors
Meena Arunachalam, Vrushabh Sanghavi, Yi A Yao, Yi A Zhou, Lifeng A, Wang, Zongru Wen, Niroop Ammbashankar, Ning W Wang, Fahim Mohammad

TL;DR
This paper demonstrates how comprehensive optimization strategies on Intel Xeon processors significantly enhance the performance of various end-to-end AI pipelines across multiple domains.
Contribution
It introduces specific optimization techniques and hardware/software acceleration methods tailored for Intel Xeon processors to improve E2E AI pipeline performance.
Findings
Performance improvements ranged from 1.8x to 81.7x across pipelines.
Optimization strategies are effective for diverse AI tasks including CV, NLP, and recommendation systems.
Leveraging Xeon features enables efficient parallel execution of multiple pipelines.
Abstract
End-to-end (E2E) artificial intelligence (AI) pipelines are composed of several stages including data preprocessing, data ingestion, defining and training the model, hyperparameter optimization, deployment, inference, postprocessing, followed by downstream analyses. To obtain efficient E2E workflow, it is required to optimize almost all the stages of pipeline. Intel Xeon processors come with large memory capacities, bundled with AI acceleration (e.g., Intel Deep Learning Boost), well suited to run multiple instances of training and inference pipelines in parallel and has low total cost of ownership (TCO). To showcase the performance on Xeon processors, we applied comprehensive optimization strategies coupled with software and hardware acceleration on variety of E2E pipelines in the areas of Computer Vision, NLP, Recommendation systems, etc. We were able to achieve a performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Parallel Computing and Optimization Techniques · Machine Learning and Data Classification
