Separating Intelligence from Execution: A Workflow Engine for the Model Context Protocol
Abhinav Singh Parmar

TL;DR
The paper introduces the MCP Workflow Engine, a system that separates decision-making from execution in LLM agent tool interactions, significantly reducing token costs and enabling efficient, deterministic orchestration of complex workflows.
Contribution
It presents a novel MCP-native orchestration layer that decouples intelligence from execution, formalizes the MCP Mediator pattern, and demonstrates substantial efficiency gains in a large-scale Kubernetes task.
Findings
Reduces per-execution token cost by over 99%.
Completes complex workflows in under 45 seconds.
Achieves deterministic, idempotent execution with zero agent involvement.
Abstract
Large Language Model (LLM) agents increasingly interact with external systems through tool-calling protocols such as the Model Context Protocol (MCP). In prevailing architectures, the agent must reason about every tool invocation in every session, consuming tokens proportional to the number of actions performed--even when the task has been solved before. We present the MCP Workflow Engine, a novel MCP-native orchestration layer that decouples intelligence (deciding what to do) from execution (carrying it out). An agent reasons once to produce a declarative workflow blueprint--a JSON document specifying a directed sequence of MCP tool calls with parameterized templates, loops, parallel branches, and data piping. Subsequent executions are triggered by a single run_workflow tool call, consuming one invocation's worth of tokens regardless of the blueprint's internal complexity. We formalize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
