Performance Benchmarking of Tensor Trains for accelerated Quantum-Inspired Homogenization on TPU, GPU and CPU architectures
Sascha H. Hauck, Matthias Kabel, Nicolas R. Gauger

TL;DR
This paper benchmarks tensor train operations on modern hardware accelerators and adapts a quantum-inspired homogenization algorithm to achieve significant speed-ups on GPUs and TPUs for large-scale datasets.
Contribution
It evaluates tensor train operations on CPUs, GPUs, and TPUs and adapts a SFFT-based homogenization algorithm for accelerators, enabling faster processing of high-resolution datasets.
Findings
GPUs and TPUs achieve comparable performance in homogenization tasks.
Speed-ups of up to 10x are achieved on accelerators compared to CPU implementations.
Both architectures show potential for industrial-scale quantum-inspired homogenization applications.
Abstract
Recent advances in high-resolution CT-imaging technology are creating a new class of ultra-high resolved micro-structural datasets that challenge the limits of traditional homogenization approaches. While state-of-the-art FFT-based homogenization techniques remain effective for moderate datasets, their memory footprint and computational cost grow rapidly with increasing resolution, making them increasingly inefficient for industrial-scale problems. To address these challenges, the recently developed Superfast-Fourier Transform (SFFT)-based homogenization algorithm leverages the memory-efficient low-rank representations of Tensor Trains (TTs), which reduce the storage and computational requirements of large-scale homogenization problems. Developed for CPU usage, SFFT-based Homogenization efficiently handles high-resolution datasets, assuming the underlying data is well-behaved. In this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Electron Microscopy Techniques and Applications · Tensor decomposition and applications · Digital Holography and Microscopy
