AgentTTS: Large Language Model Agent for Test-time Compute-optimal Scaling Strategy in Complex Tasks

Fali Wang; Hui Liu; Zhenwei Dai; Jingying Zeng; Zhiwei Zhang; Zongyu Wu; Chen Luo; Zhen Li; Xianfeng Tang; Qi He; Suhang Wang

arXiv:2508.00890·cs.AI·October 23, 2025

AgentTTS: Large Language Model Agent for Test-time Compute-optimal Scaling Strategy in Complex Tasks

Fali Wang, Hui Liu, Zhenwei Dai, Jingying Zeng, Zhiwei Zhang, Zongyu Wu, Chen Luo, Zhen Li, Xianfeng Tang, Qi He, Suhang Wang

PDF

Open Access

TL;DR

This paper introduces AgentTTS, an LLM-agent framework that optimally allocates compute resources across multi-stage tasks, improving performance and efficiency over traditional methods.

Contribution

The paper presents a novel LLM-agent-based approach for test-time compute-optimal scaling in complex multi-stage tasks, addressing combinatorial search challenges.

Findings

01

AgentTTS outperforms baselines in search efficiency.

02

It demonstrates robustness to varying training set sizes.

03

It offers improved interpretability of allocations.

Abstract

Test-time scaling (TTS) enhances the performance of large language models (LLMs) by allocating additional compute resources during inference. However, existing research primarily investigates TTS in single-stage tasks; while many real-world problems are multi-stage complex tasks, composed of a sequence of heterogeneous subtasks with each subtask requires LLM of specific capability. Therefore, we study a novel problem: the test-time compute-optimal scaling in multi-stage complex tasks, aiming to select suitable models and allocate budgets per subtask to maximize overall performance. TTS in multi-stage tasks introduces two fundamental challenges: (i) The combinatorial search space of model and budget allocations, combined with the high cost of inference, makes brute-force search impractical. (ii) The optimal model and budget allocations across subtasks are interdependent, increasing the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Natural Language Processing Techniques