Numerical Kernels on a Spatial Accelerator: A Study of Tenstorrent Wormhole

Maya Taylor; Carl Pearson; Luc Berger-Vergiat; Giovanni Long; Jan Ciesko

arXiv:2603.23343·cs.PF·March 25, 2026

Numerical Kernels on a Spatial Accelerator: A Study of Tenstorrent Wormhole

Maya Taylor, Carl Pearson, Luc Berger-Vergiat, Giovanni Long, Jan Ciesko

PDF

Open Access

TL;DR

This paper evaluates Tenstorrent's Wormhole spatial AI accelerator for scientific computing workloads, implementing numerical kernels and a conjugate gradient solver to compare performance with GPUs, highlighting its potential and challenges.

Contribution

It provides the first detailed analysis of numerical kernel performance on a spatial AI architecture, with architecture-specific optimizations and comparative evaluation against GPUs.

Findings

01

AI accelerators can effectively run scientific numerical workloads.

02

Performance gaps and optimization opportunities are identified.

03

Spatial architectures show promise for scientific computing applications.

Abstract

As AI accelerators gain prominence, their potential for traditional scientific computing workloads remains unclear. This paper explores Tenstorrent's Wormhole architecture, a spatial computing platform designed for neural network acceleration, by implementing three numerical kernels and composing them into a conjugate gradient solver. We present architecture-specific optimizations for sparse numerical algorithms, evaluate their performance against Nvidia GPUs, and expose both challenges and opportunities in porting numerical methods to spatial architectures. Our results demonstrate that AI accelerators merit consideration for workloads traditionally dominated by CPUs and GPUs, and more work should be invested in understanding the capabilities of these architectures and making them accessible to the scientific computing community.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Parallel Computing and Optimization Techniques · Model Reduction and Neural Networks