Shifting the Sweet Spot: High-Performance Matrix-Free Method for High-Order Elasticity
Dali Chang, Chong Zhang, Kaiqi Zhang, Mingguan Yang, Huiyuan Li, Weiqiang Kong

TL;DR
This paper introduces a highly optimized matrix-free finite element method for high-order elasticity that significantly improves performance and shifts the efficiency 'sweet spot' to higher polynomial degrees on mainstream CPU architectures.
Contribution
The authors develop a novel, optimized matrix-free operator with tensor factorization and macro-kernel fusion, enabling high-order elasticity simulations to run efficiently on modern CPUs.
Findings
Achieves 7x to 83x speedup over baseline
Shifts performance 'sweet spot' to polynomial degree p ≥ 6
Enables large-scale high-order elasticity simulations
Abstract
In high-order finite element analysis for elasticity, matrix-free (PA) methods are a key technology for overcoming the memory bottleneck of traditional Full Assembly (FA). However, existing implementations fail to fully exploit the special structure of modern CPU architectures and tensor-product elements, causing their performance "sweet spot" to anomalously remain at the low order of , which severely limits the potential of high-order methods. To address this challenge, we design and implement a highly optimized PA operator within the MFEM framework, deeply integrated with a Geometric Multigrid (GMG) preconditioner. Our multi-level optimization strategy includes replacing the original generic algorithm with an efficient one based on tensor factorization, exploiting Voigt symmetry to reduce redundant computations for the elasticity problem, and employing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Parallel Computing and Optimization Techniques · Matrix Theory and Algorithms
