Guaranteed Guess: A Language Modeling Approach for CISC-to-RISC Transpilation with Testing Guarantees

Ahmed Heakl; Sarim Hashmi; Chaimaa Abi; Celine Lee; Abdulrahman Mahmoud

arXiv:2506.14606·cs.CL·June 18, 2025

Guaranteed Guess: A Language Modeling Approach for CISC-to-RISC Transpilation with Testing Guarantees

Ahmed Heakl, Sarim Hashmi, Chaimaa Abi, Celine Lee, Abdulrahman Mahmoud

PDF

Open Access 6 Models 5 Datasets 1 Video

TL;DR

This paper presents GG, a novel CISC-to-RISC transpilation method combining large language models with testing to ensure high correctness and efficiency, outperforming existing frameworks like Rosetta 2.

Contribution

Introduces GG, an ISA-centric transpilation pipeline that integrates LLMs with testing frameworks to improve correctness and performance in CISC-to-RISC translation.

Findings

01

Achieves 99% correctness on HumanEval programs.

02

Enforces over 98% code coverage in testing.

03

Outperforms Rosetta 2 in speed, energy, and memory efficiency.

Abstract

The hardware ecosystem is rapidly evolving, with increasing interest in translating low-level programs across different instruction set architectures (ISAs) in a quick, flexible, and correct way to enhance the portability and longevity of existing code. A particularly challenging class of this transpilation problem is translating between complex- (CISC) and reduced- (RISC) hardware architectures, due to fundamental differences in instruction complexity, memory models, and execution paradigms. In this work, we introduce GG (Guaranteed Guess), an ISA-centric transpilation pipeline that combines the translation power of pre-trained large language models (LLMs) with the rigor of established software testing constructs. Our method generates candidate translations using an LLM from one ISA to another, and embeds such translations within a software-testing framework to build quantifiable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

Videos

Guaranteed Guess: A Language Modeling Approach for CISC-to-RISC Transpilation with Testing Guarantees· underline

Taxonomy

TopicsMathematics, Computing, and Information Processing · Natural Language Processing Techniques

MethodsSparse Evolutionary Training