Agent JIT Compilation for Latency-Optimizing Web Agent Planning and Scheduling
Caleb Winston, Ron Yifeng Wang, Azalia Mirhoseini, Christos Kozyrakis

TL;DR
This paper introduces agent JIT compilation, a method to reduce latency and errors in web automation agents by compiling task descriptions into executable code with integrated planning and scheduling.
Contribution
It presents a novel JIT compilation framework with planning, scheduling, and invariant protocols to improve web agent performance and accuracy.
Findings
10.4× speedup in web application tasks
28% accuracy improvement over Browser-Use
2.4× speedup with 9% accuracy gain over OpenAI CUA
Abstract
Computer-use agents (CUA) automate tasks specified with natural language such as "order the cheapest item from Taco Bell" by generating sequences of calls to tools such as click, type, and scroll on a browser. Current implementations follow a sequential fetch-screenshot-execute loop where each iteration requires an LLM call, resulting in high latency and frequent errors from incorrect tool use. We present agent just-in-time (JIT) compilation, an alternative that compiles task descriptions directly into executable code that is free to include LLM calls, tool calls, and parallelization. Our approach comprises three components: (1) JIT-Planner, which generates multiple code plans, validates each against tool specifications, and selects the minimum-cost candidate; (2) JIT-Scheduler, which explores parallelization strategies via Monte Carlo cost estimation from learned latency distributions;…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
