TSCG: Deterministic Tool-Schema Compilation for Agentic LLM Deployments
Furkan Sakizli

TL;DR
TSCG is a deterministic compiler that converts JSON tool schemas into token-efficient structured text, significantly improving tool-use accuracy for small language models without fine-tuning.
Contribution
It introduces a formal, composable method for schema compression and interpretation, reducing tool-use failures in agent frameworks for small models.
Findings
Restores 14B model accuracy from 0% to 84.4% on 20 tools.
Achieves 108-181% accuracy-retained ratio across models.
Provides token savings of 52-57% on heavy schemas.
Abstract
Production agent frameworks (OpenAI Function Calling, Anthropic Tool Use, MCP) transmit tool schemas as JSON, a format designed for machine parsing, not for interpretation by language models. For small models (4B-14B), this protocol mismatch accounts for the majority of tool-use failure at production catalog sizes. We present TSCG, a deterministic tool-schema compiler that resolves this mismatch at the API boundary, converting JSON schemas into token-efficient structured text without model access, fine-tuning, or runtime search. TSCG combines eight composable operators with a formal compression bound (>=51% on well-formed schemas). On TSCG-Agentic-Bench (about 19,000 calls, 12 models, 5 scenarios), TSCG restores Phi-4 14B from 0% to 84.4% accuracy at 20 tools (90.3% at 50 tools) and achieves 108-181% accuracy-retained ratio across three models on BFCL. Format-versus-compression…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
