Operating-Layer Controls for Onchain Language-Model Agents Under Real Capital

T.J. Barton; Chris Constantakis; Patti Hauseman; Annie Mous; Alaska Hoffman; Brian Bergeron; Hunter Goodreau

arXiv:2604.26091·cs.AI·April 30, 2026

Operating-Layer Controls for Onchain Language-Model Agents Under Real Capital

T.J. Barton, Chris Constantakis, Patti Hauseman, Annie Mous, Alaska Hoffman, Brian Bergeron, Hunter Goodreau

PDF

TL;DR

This paper presents a comprehensive operating layer that enhances the reliability of autonomous language-model agents trading real ETH onchain, emphasizing controls, validation, and observability to ensure successful execution.

Contribution

It introduces an operating layer around language models that significantly improves reliability and correctness in real capital trading scenarios, validated through large-scale deployment.

Findings

01

99.9% settlement success rate for policy-valid transactions

02

Reduced fabricated trading rules from 57% to 3% through targeted controls

03

Increased capital deployment from 42.9% to 78.0% after harness improvements

Abstract

We study reliability in autonomous language-model agents that translate user mandates into validated tool actions under real capital. The setting is DX Terminal Pro, a 21-day deployment in which 3,505 user-funded agents traded real ETH in a bounded onchain market. Users configured vaults through structured controls and natural-language strategies, but only agents could choose normal buy/sell trades. The system produced 7.5M agent invocations, roughly 300K onchain actions, about $20M in volume, more than 5,000 ETH deployed, roughly 70B inference tokens, and 99.9% settlement success for policy-valid submitted transactions. Long-running agents accumulated thousands of sequential decisions, including 6,000+ prompt-state-action cycles for continuously active agents, yielding a large-scale trace from user mandate to rendered prompt, reasoning, validation, portfolio state, and settlement.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.