Towards an Efficient Use of the BLAS Library for Multilinear Tensor Contractions
Edoardo Di Napoli (1, 2), Diego Fabregat-Traver (2), Gregorio, Quintana-Ort\`i (3), Paolo Bientinesi (2) ((1) J\"ulich Supercomputing, Centre, Forschungszentrum J\"ulich, (2) AICES, RWTH-Aachen University, (3), Depto. de Ingenier\`ia y Ciencia de Computadores

TL;DR
This paper investigates how to efficiently perform tensor contractions using the BLAS library, specifically identifying when GEMM can be applied and providing guidelines for optimal use based on tensor properties.
Contribution
It establishes conditions for using GEMM in tensor contractions and offers a classification and practical guidelines to improve computational efficiency.
Findings
Conditions for applying GEMM to tensor contractions are defined.
A classification of tensor contractions into three groups is proposed.
Guidelines for effective use of BLAS in tensor computations are provided.
Abstract
Mathematical operators whose transformation rules constitute the building blocks of a multi-linear algebra are widely used in physics and engineering applications where they are very often represented as tensors. In the last century, thanks to the advances in tensor calculus, it was possible to uncover new research fields and make remarkable progress in the existing ones, from electromagnetism to the dynamics of fluids and from the mechanics of rigid bodies to quantum mechanics of many atoms. By now, the formal mathematical and geometrical properties of tensors are well defined and understood; conversely, in the context of scientific and high-performance computing, many tensor- related problems are still open. In this paper, we address the problem of efficiently computing contractions among two tensors of arbitrary dimension by using kernels from the highly optimized BLAS library. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Tensor decomposition and applications · Computational Physics and Python Applications
