Distributed Transfer Learning with 4th Gen Intel Xeon Processors
Lakshmi Arunachalam, Fahim Mohammad, Vrushabh H. Sanghavi

TL;DR
This paper demonstrates that with 4th Gen Intel Xeon processors and advanced matrix extensions, transfer learning can achieve near state-of-the-art image classification accuracy using distributed training, challenging the GPU-centric view.
Contribution
It introduces a novel approach combining Intel Xeon processors and transfer learning for high-accuracy image classification, highlighting CPU capabilities.
Findings
Achieved near state-of-the-art accuracy on image classification dataset.
Showed effective use of Intel AMX for deep learning tasks.
Validated distributed training on CPU architecture.
Abstract
In this paper, we explore how transfer learning, coupled with Intel Xeon, specifically 4th Gen Intel Xeon scalable processor, defies the conventional belief that training is primarily GPU-dependent. We present a case study where we achieved near state-of-the-art accuracy for image classification on a publicly available Image Classification TensorFlow dataset using Intel Advanced Matrix Extensions(AMX) and distributed training with Horovod.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Machine Learning and Algorithms
