TL;DR
KAIJU introduces a system-level abstraction for LLM agents that decouples reasoning from execution, improving security, parallelism, and behavioral guarantees over traditional prompt-based methods.
Contribution
It proposes an Executive Kernel and Intent-Gated Execution to enhance LLM agent robustness, security, and efficiency, with a novel planning and parallel tool dispatch approach.
Findings
KAIJU achieves better parallel data gathering for complex queries.
The system enforces behavioral guarantees beyond prompt-based methods.
Empirical results show latency trade-offs and advantages at moderate complexity.
Abstract
Tool-calling autonomous agents based on large language models using ReAct exhibit three limitations: serial latency, quadratic context growth, and vulnerability to prompt injection and hallucination. Recent work moves towards separating planning from execution but in each case the model remains coupled to the execution mechanics. We introduce a system-level abstraction for LLM agents which decouples the execution of agent workflows from the LLM reasoning layer. We define two first-class abstractions: (1) Intent-Gated Execution (IGX), a security paradigm that enforces intent at execution, and (2) an Executive Kernel that manages scheduling, tool dispatch, dependency resolution, failures and security. In KAIJU, the LLM plans upfront, optimistically scheduling tools in parallel with dependency-aware parameter injection. Tools are authorised via IGX based on four independent variables:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
