A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP
Xi Yang, Aurelie Lozano, Naoki Abe, Bhavya, Saurabh Jha, Noah Zheutlin, Rohan R. Arora, Yu Deng, Daby M. Sow

TL;DR
This paper introduces a lightweight, model-agnostic framework called DT-MDP-CE that enhances enterprise AI agents by combining digital-twin MDP modeling, contrastive inverse reinforcement learning, and RL-guided context engineering, leading to improved decision-making.
Contribution
The paper presents a novel, integrated framework for offline reinforcement learning that improves enterprise AI agents by modeling reasoning as DT-MDP and estimating reward functions from offline data.
Findings
Significant performance improvements over baseline agents.
Framework generalizes across enterprise AI tasks.
Effective in complex real-world reasoning scenarios.
Abstract
Despite rapid progress in AI agents for enterprise automation and decision-making, their real-world deployment and further performance gains remain constrained by limited data quality and quantity, complex real-world reasoning demands, difficulties with self-play, and the lack of reliable feedback signals. To address these challenges, we propose a lightweight, model-agnostic framework for improving LLM-based enterprise agents via offline reinforcement learning (RL). The proposed Context Engineering via DT-MDP (DT-MDP-CE) framework comprises three key components: (1) A Digital-Twin Markov Decision Process (DT-MDP), which abstracts the agent's reasoning behavior as a finite MDP; (2) A robust contrastive inverse RL, which, armed with the DT-MDP, to efficiently estimate a well-founded reward function and induces policies from mixed-quality offline trajectories; and (3) RL-guided context…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Transformation in Industry · Software System Performance and Reliability · IoT and Edge/Fog Computing
