Benchmarking Data Analysis and Machine Learning Applications on the   Intel KNL Many-Core Processor

Chansup Byun; Jeremy Kepner; William Arcand; David Bestor; Bill; Bergeron; Vijay Gadepally; Michael Houle; Matthew Hubbell; Michael Jones,; Anna Klein; Peter Michaleas; Lauren Milechin; Julie Mullen; Andrew Prout,; Antonio Rosa; Siddharth Samsi; Charles Yee; Albert Reuther

arXiv:1707.03515·cs.PF·March 6, 2018

Benchmarking Data Analysis and Machine Learning Applications on the Intel KNL Many-Core Processor

Chansup Byun, Jeremy Kepner, William Arcand, David Bestor, Bill, Bergeron, Vijay Gadepally, Michael Houle, Matthew Hubbell, Michael Jones,, Anna Klein, Peter Michaleas, Lauren Milechin, Julie Mullen, Andrew Prout,, Antonio Rosa, Siddharth Samsi, Charles Yee, Albert Reuther

PDF

TL;DR

This paper benchmarks data analysis and machine learning applications on the Intel KNL many-core processor, showing significant performance improvements over previous technologies and other Intel CPUs, highlighting KNL's suitability for these workloads.

Contribution

It provides a comprehensive performance evaluation of data analysis and machine learning applications on the Intel KNL processor, demonstrating notable speedups and efficiency gains.

Findings

01

DGEMM performance improved by ~3.5x on KNL

02

Applications achieved ~60% of theoretical peak performance

03

Caffe machine learning application showed 2.7x speedup on KNL

Abstract

Knights Landing (KNL) is the code name for the second-generation Intel Xeon Phi product family. KNL has generated significant interest in the data analysis and machine learning communities because its new many-core architecture targets both of these workloads. The KNL many-core vector processor design enables it to exploit much higher levels of parallelism. At the Lincoln Laboratory Supercomputing Center (LLSC), the majority of users are running data analysis applications such as MATLAB and Octave. More recently, machine learning applications, such as the UC Berkeley Caffe deep learning framework, have become increasingly important to LLSC users. Thus, the performance of these applications on KNL systems is of high interest to LLSC users and the broader data analysis and machine learning communities. Our data analysis benchmarks of these application on the Intel KNL processor indicate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.