Timely Machine: Awareness of Time Makes Test-Time Scaling Agentic

Yichuan Ma; Linyang Li; Yongkang chen; Peiji Li; Xiaozhe Li; Qipeng Guo; Dahua Lin; Kai Chen

arXiv:2601.16486·cs.CL·January 26, 2026

Timely Machine: Awareness of Time Makes Test-Time Scaling Agentic

Yichuan Ma, Linyang Li, Yongkang chen, Peiji Li, Xiaozhe Li, Qipeng Guo, Dahua Lin, Kai Chen

PDF

Open Access

TL;DR

This paper introduces Timely Machine, a framework for dynamic, time-aware test-time scaling of language models in agentic scenarios, emphasizing wall-clock time and adaptive strategies.

Contribution

It redefines test-time as wall-clock time, introduces a new benchmark, and proposes reinforcement learning for models to adapt reasoning based on time budgets.

Findings

01

Smaller models perform better with fast feedback and frequent interactions.

02

Larger models excel in high-latency settings due to better interaction quality.

03

Timely-RL improves models' awareness of time constraints and enhances performance.

Abstract

As large language models (LLMs) increasingly tackle complex reasoning tasks, test-time scaling has become critical for enhancing capabilities. However, in agentic scenarios with frequent tool calls, the traditional generation-length-based definition breaks down: tool latency decouples inference time from generation length. We propose Timely Machine, redefining test-time as wall-clock time, where models dynamically adjust strategies based on time budgets. We introduce Timely-Eval, a benchmark spanning high-frequency tool calls, low-frequency tool calls, and time-constrained reasoning. By varying tool latency, we find smaller models excel with fast feedback through more interactions, while larger models dominate high-latency settings via superior interaction quality. Moreover, existing models fail to adapt reasoning to time budgets. We propose Timely-RL to address this gap. After…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Reinforcement Learning in Robotics