Schema First Tool APIs for LLM Agents: A Controlled Study of Tool Misuse, Recovery, and Budgeted Performance
Akshey Sigdel, Rista Baral

TL;DR
This study investigates how schema-based tool contracts and diagnostics affect the reliability of LLM agents under strict interaction budgets, finding that formalization reduces misuse but does not fully address semantic errors or timeout issues.
Contribution
It provides a controlled experimental comparison of interface design strategies for LLM tools, highlighting the impact of schema formalization on misuse and reliability.
Findings
Schema conditions reduce interface misuse
Success rate remains zero across conditions
Semantic misuse and timeouts are unaffected by schema formalization
Abstract
Tool use has become central to modern LLM agents, yet interface design is rarely isolated as an experimental variable. This paper studies whether schema based tool contracts and structured validation diagnostics improve reliability under strict interaction budgets. We evaluate three conditions that preserve identical tool semantics and information content: free form documentation, JSON Schema specifications, and JSON Schema with structured diagnostics. We implement a deterministic software engineering sandbox with logs, metrics, configurations, and repository tasks, and evaluate a fully crossed pilot with one open local model, three seeds, three interface conditions, and four budgets. We report end task success, interface misuse, execution failures, semantic misuse, recovery behavior, and overhead. In this pilot, success remains zero across conditions, while schema conditions reduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software System Performance and Reliability · Advanced Software Engineering Methodologies
