RunAgent: Interpreting Natural-Language Plans with Constraint-Guided Execution

Arunabh Srivastava; Mohammad A. (Amir) Khojastepour; Srimat Chakradhar; Sennur Ulukus

arXiv:2605.00798·cs.LG·May 4, 2026

RunAgent: Interpreting Natural-Language Plans with Constraint-Guided Execution

Arunabh Srivastava, Mohammad A. (Amir) Khojastepour, Srimat Chakradhar, Sennur Ulukus

PDF

TL;DR

RunAgent is a multi-agent platform that interprets natural-language plans, enforces constraints, and dynamically chooses reasoning methods to improve structured workflow execution accuracy.

Contribution

It introduces an agentic language with explicit control constructs and autonomous constraint validation to enhance LLM-based plan execution reliability.

Findings

01

RunAgent outperforms baseline LLMs on Natural-plan and SciBench datasets.

02

It effectively enforces stepwise constraints and validation during plan execution.

03

The system dynamically switches between reasoning, tool use, and code execution.

Abstract

Humans solve problems by executing targeted plans, yet large language models (LLMs) remain unreliable for structured workflow execution. We propose RunAgent, a multi-agent plan execution platform that interprets natural-language plans while enforcing stepwise execution through constraints and rubrics. RunAgent bridges the expressiveness of natural language with the determinism of programming via an agentic language with explicit control constructs (e.g., \texttt{IF}, \texttt{GOTO}, \texttt{FORALL}). Beyond verifying syntactic and semantic verification of the step output, which is performed based on the specific instruction of each step, RunAgent autonomously derives and validates constraints based on the description of the task and its instance at each step. RunAgent also dynamically selects among LLM-based reasoning, tool usage, and code generation and execution (e.g., in Python), and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.