Strategies for Optimizing End-to-End Artificial Intelligence Pipelines   on Intel Xeon Processors

Meena Arunachalam; Vrushabh Sanghavi; Yi A Yao; Yi A Zhou; Lifeng A; Wang; Zongru Wen; Niroop Ammbashankar; Ning W Wang; Fahim Mohammad

arXiv:2211.00286·cs.LG·November 2, 2022

Strategies for Optimizing End-to-End Artificial Intelligence Pipelines on Intel Xeon Processors

Meena Arunachalam, Vrushabh Sanghavi, Yi A Yao, Yi A Zhou, Lifeng A, Wang, Zongru Wen, Niroop Ammbashankar, Ning W Wang, Fahim Mohammad

PDF

Open Access

TL;DR

This paper demonstrates how comprehensive optimization strategies on Intel Xeon processors significantly enhance the performance of various end-to-end AI pipelines across multiple domains.

Contribution

It introduces specific optimization techniques and hardware/software acceleration methods tailored for Intel Xeon processors to improve E2E AI pipeline performance.

Findings

01

Performance improvements ranged from 1.8x to 81.7x across pipelines.

02

Optimization strategies are effective for diverse AI tasks including CV, NLP, and recommendation systems.

03

Leveraging Xeon features enables efficient parallel execution of multiple pipelines.

Abstract

End-to-end (E2E) artificial intelligence (AI) pipelines are composed of several stages including data preprocessing, data ingestion, defining and training the model, hyperparameter optimization, deployment, inference, postprocessing, followed by downstream analyses. To obtain efficient E2E workflow, it is required to optimize almost all the stages of pipeline. Intel Xeon processors come with large memory capacities, bundled with AI acceleration (e.g., Intel Deep Learning Boost), well suited to run multiple instances of training and inference pipelines in parallel and has low total cost of ownership (TCO). To showcase the performance on Xeon processors, we applied comprehensive optimization strategies coupled with software and hardware acceleration on variety of E2E pipelines in the areas of Computer Vision, NLP, Recommendation systems, etc. We were able to achieve a performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Parallel Computing and Optimization Techniques · Machine Learning and Data Classification