A Proposed Framework for Advanced (Multi)Linear Infrastructure in Engineering and Science (FAMLIES)

Devin A. Matthews; Tze Meng Low; Margaret E. Myers; Devangi N. Parikh; Robert A. van de Geijn

arXiv:2604.07311·cs.MS·April 9, 2026

A Proposed Framework for Advanced (Multi)Linear Infrastructure in Engineering and Science (FAMLIES)

Devin A. Matthews, Tze Meng Low, Margaret E. Myers, Devangi N. Parikh, Robert A. van de Geijn

PDF

TL;DR

This paper proposes a flexible, vertically integrated software framework for high-performance linear algebra and tensor computations across diverse hardware architectures, building on prior successful projects.

Contribution

It introduces a new framework that unifies dense linear and multi-linear software stacks, enabling efficient computations from node-level to massively parallel systems.

Findings

01

Framework demonstrates high performance on CPU and GPU architectures.

02

Implementation of key linear algebra and tensor operations showcases flexibility.

03

Lays groundwork for broader scientific and machine learning software integration.

Abstract

We leverage highly successful prior projects sponsored by multiple NSF grants and gifts from industry: the BLAS-like Library Instantiation Software (BLIS) and the libflame efforts to lay the foundation for a new flexible framework by vertically integrating the dense linear and multi-linear (tensor) software stacks that are important to modern computing. This vertical integration will enable high-performance computations from node-level to massively-parallel, and across both CPU and GPU architectures. The effort builds on decades of experience by the research team turning fundamental research on the systematic derivation of algorithms (the NSF-sponsored FLAME project) into practical software for this domain, targeting single and multi-core (BLIS, TBLIS, and libflame), GPU-accelerated (SuperMatrix), and massively parallel (PLAPACK, Elemental, and ROTE) compute environments. This project…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.