Using PHAST to port Caffe library: First experiences and lessons learned
Eduardo Jos\'e G\'omez-Hern\'andez, Pablo Antonio Mart\'inez, Biagio, Peccerillo, Sandro Bartolini, Jos\'e Manuel Garc\'ia, Gregorio Bernab\'e

TL;DR
This paper explores using the PHAST library to port the Caffe deep-learning framework across different hardware architectures, highlighting initial experiences, challenges, and performance considerations for achieving performance portability.
Contribution
It demonstrates a practical approach to porting a complex deep-learning framework using PHAST, providing insights and lessons learned for future multi-architecture software development.
Findings
Caffe was successfully ported to CPUs and GPUs using PHAST.
The porting process revealed key challenges and lessons learned.
Initial performance analysis shows potential for performance portability.
Abstract
Performance has always been a hot topic in computing. However, the viable ways to achieve it have taken many forms in the different moments of computing history. Today, technological limits have pushed the adoption of increasingly parallel multi-core and many-core architectures and even the use of highly specific hardware (aka Domain-Specific Architectures, or DSAs) to solve very specific problems. In this new context, one major problem is how to develop software once, and be able to run it on multiple accelerator architectures, seamlessly. Ideally aiming at a single programming model that can automatically target the code to different kinds of parallel architectures, allowing specific tuning with minimal, if any, changes to the source-code in order to seek performance portability. A comprehensive solution to this is still lacking. In this work, we present the use of the PHAST Library,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Data Storage Technologies · Embedded Systems Design Techniques
