TL;DR
This paper introduces Phylanx, a toolkit that enables high-level Python and NumPy code to run efficiently in parallel and distributed environments on HPC systems, bridging ease of programming with performance.
Contribution
It presents a novel approach to execute Python and NumPy operations asynchronously on HPC resources using a dependency tree mapped onto the HPX runtime system.
Findings
Achieved parallel execution of machine learning algorithms in Python.
Demonstrated performance comparable to standard NumPy implementations.
Provided debugging and visualization tools for performance analysis.
Abstract
Despite advancements in the areas of parallel and distributed computing, the complexity of programming on High Performance Computing (HPC) resources has deterred many domain experts, especially in the areas of machine learning and artificial intelligence (AI), from utilizing performance benefits of such systems. Researchers and scientists favor high-productivity languages to avoid the inconvenience of programming in low-level languages and costs of acquiring the necessary skills required for programming at this level. In recent years, Python, with the support of linear algebra libraries like NumPy, has gained popularity despite facing limitations which prevent this code from distributed runs. Here we present a solution which maintains both high level programming abstractions as well as parallel and distributed efficiency. Phylanx, is an asynchronous array processing toolkit which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
