QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems   with a Neural-Symbolic Approach

Shouyang Dong; Yuanbo Wen; Jun Bi; Di Huang; Jiaming Guo; Jianxing Xu,; Ruibai Xu; Xinkai Song; Yifan Hao; Xuehai Zhou; Tianshi Chen; Qi Guo; Yunji; Chen

arXiv:2505.02146·cs.CL·May 6, 2025

QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach

Shouyang Dong, Yuanbo Wen, Jun Bi, Di Huang, Jiaming Guo, Jianxing Xu,, Ruibai Xu, Xinkai Song, Yifan Hao, Xuehai Zhou, Tianshi Chen, Qi Guo, Yunji, Chen

PDF

Open Access

TL;DR

QiMeng-Xpiler is a neural-symbolic transcompiler that automatically translates tensor programs across diverse deep learning hardware, significantly improving productivity and performance with high accuracy.

Contribution

It introduces a novel neural-symbolic approach combining LLMs and symbolic synthesis for cross-platform tensor program transcompilation.

Findings

01

Achieves 95% translation accuracy across 4 DLS platforms.

02

Up to 2.0x performance improvement over vendor libraries.

03

Up to 96.0x increase in programming productivity.

Abstract

Heterogeneous deep learning systems (DLS) such as GPUs and ASICs have been widely deployed in industrial data centers, which requires to develop multiple low-level tensor programs for different platforms. An attractive solution to relieve the programming burden is to transcompile the legacy code of one platform to others. However, current transcompilation techniques struggle with either tremendous manual efforts or functional incorrectness, rendering "Write Once, Run Anywhere" of tensor programs an open question. We propose a novel transcompiler, i.e., QiMeng-Xpiler, for automatically translating tensor programs across DLS via both large language models (LLMs) and symbolic program synthesis, i.e., neural-symbolic synthesis. The key insight is leveraging the powerful code generation ability of LLM to make costly search-based symbolic synthesis computationally tractable. Concretely, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Physics and Python Applications · Parallel Computing and Optimization Techniques · Model Reduction and Neural Networks