Accelerating the Execution of Matrix Languages on the Cell Broadband Engine Architecture
Raymes Khoury, Bernd Burgstaller, Bernhard Scholz

TL;DR
This paper introduces a framework that extends Octave to efficiently utilize the IBM Cell processor's architecture, enabling parallel execution of matrix operations without explicit parallel programming, resulting in significant speedups.
Contribution
It presents a novel extension of Octave with a new matrix data-type that automatically exploits parallelism on the Cell architecture, including data, instruction, pipeline, and task parallelism.
Findings
Achieves up to 12x speedup over Intel Core2 Quad processors.
Effectively exploits multiple parallelism levels in matrix computations.
Validates the approach through extensive experiments.
Abstract
Matrix languages, including MATLAB and Octave, are established standards for applications in science and engineering. They provide interactive programming environments that are easy to use due to their scripting languages with matrix data types. Current implementations of matrix languages do not fully utilise high-performance, special-purpose chip architectures such as the IBM PowerXCell processor (Cell), which is currently used in the fastest computer in the world. We present a new framework that extends Octave to harness the computational power of the Cell. With this framework the programmer is relieved of the burden of introducing explicit notions of parallelism. Instead the programmer uses a new matrix data-type to execute matrix operations in parallel on the synergistic processing elements (SPEs) of the Cell. We employ lazy evaluation semantics for our new matrix data-type to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Distributed and Parallel Computing Systems · Interconnection Networks and Systems
