Automatic Transmission for LLM Tiers: Optimizing Cost and Accuracy in Large Language Models

Injae Na; Keonwoong Noh; Woohwan Jung

arXiv:2505.20921·cs.CL·May 30, 2025

Automatic Transmission for LLM Tiers: Optimizing Cost and Accuracy in Large Language Models

Injae Na, Keonwoong Noh, Woohwan Jung

PDF

Open Access 1 Repo

TL;DR

This paper introduces LLM-AT, an automatic framework that dynamically selects and upgrades LLM tiers for NLP tasks to optimize cost and accuracy without requiring training.

Contribution

The paper presents a novel LLM-AT framework with a tier selection and upgrading mechanism, including an accuracy estimator for initial tier choice, enhancing efficiency and performance.

Findings

01

LLM-AT reduces costs while maintaining high accuracy.

02

The accuracy estimator effectively predicts suitable initial tiers.

03

Experiments show improved performance over static tier selection.

Abstract

LLM providers typically offer multiple LLM tiers, varying in performance and price. As NLP tasks become more complex and modularized, selecting the suitable LLM tier for each subtask is a key challenge to balance between cost and performance. To address the problem, we introduce LLM Automatic Transmission (LLM-AT) framework that automatically selects LLM tiers without training. LLM-AT consists of Starter, Generator, and Judge. The starter selects the initial LLM tier expected to solve the given question, the generator produces a response using the LLM of the selected tier, and the judge evaluates the validity of the response. If the response is invalid, LLM-AT iteratively upgrades to a higher-tier model, generates a new response, and re-evaluates until a valid response is obtained. Additionally, we propose accuracy estimator, which enables the suitable initial LLM tier selection without…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hyudsl/llm-at
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques