Natural-Language Agent Harnesses

Linyue Pan; Lexiao Zou; Shuo Guo; Jingchen Ni; Hai-Tao Zheng

arXiv:2603.25723·cs.CL·May 19, 2026

Natural-Language Agent Harnesses

Linyue Pan, Lexiao Zou, Shuo Guo, Jingchen Ni, Hai-Tao Zheng

PDF

2 Datasets

TL;DR

This paper introduces Natural-Language Agent Harnesses (NLAHs) and an Intelligent Harness Runtime (IHR) to represent and execute agent harness policies as editable natural-language documents, improving transparency and reusability.

Contribution

The paper presents a novel approach to modeling agent harnesses as natural-language objects with an interpretable runtime, enabling better inspection, transfer, and analysis.

Findings

01

NLAHs achieve comparable task outcomes to traditional code-based harnesses.

02

IHR-executed NLAHs have shorter static harness policies.

03

Explicit harness modules are shown to be analyzable.

Abstract

Agent performance is strongly shaped by the surrounding harness: the external execution system around a model that organizes a task run. Yet this logic is usually buried in tightly coupled controller code, which makes harnesses hard to inspect, compare, transfer, and ablate. This paper asks whether the reusable design pattern of an agent harness can be represented as an executable natural-language object. We introduce Natural-Language Agent Harnesses (NLAHs), editable documents that describe run-level harness policy, and Intelligent Harness Runtime (IHR), a shared runtime that interprets these documents into agent calls, handoffs, state updates, validation gates, and artifact contracts. Across coding, terminal-use, and computer-use benchmarks, IHR-executed NLAHs achieve comparable task outcomes to code and prompted realizations, while exposing much shorter static harness policies.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Adversarial Robustness in Machine Learning · Logic, programming, and type systems