TL;DR
LLM-Rosetta introduces a hub-and-spoke IR framework enabling seamless, bidirectional translation between diverse LLM API formats, enhancing interoperability and provider neutrality in multi-provider applications.
Contribution
The paper presents a modular, open-source translation system with a shared semantic core that supports multiple API standards, improving portability and reducing integration complexity.
Findings
Achieves lossless round-trip translation fidelity.
Supports bidirectional conversion with low latency.
Successfully deployed in production at Argonne National Laboratory.
Abstract
The rapid proliferation of Large Language Model (LLM) providers--each exposing proprietary API formats--has created a fragmented ecosystem where applications become tightly coupled to individual vendors. Switching or bridging providers requires bilateral adapters, impeding portability and multi-provider architectures. We observe that despite substantial syntactic divergence, the major LLM APIs share a common semantic core: the practical challenge is the combinatorial surface of syntactic variations, not deep semantic incompatibility. Based on this finding, we present LLM-Rosetta, an open-source translation framework built on a hub-and-spoke Intermediate Representation (IR) that captures the shared semantic core--messages, content parts, tool calls, reasoning traces, and generation controls--in a 9-type content model and 10-type stream event schema. A modular Ops-composition…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
