A-LAMP: Agentic LLM-Based Framework for Automated MDP Modeling and Policy Generation

Hong Je-Gal; Chan-Bin Yi; Hyun-Suk Lee

arXiv:2512.11270·cs.AI·December 15, 2025

A-LAMP: Agentic LLM-Based Framework for Automated MDP Modeling and Policy Generation

Hong Je-Gal, Chan-Bin Yi, Hyun-Suk Lee

PDF

Open Access

TL;DR

A-LAMP is a novel framework that automates the translation of natural language task descriptions into formal MDP models and policies using large language models, improving accuracy and reliability in reinforcement learning applications.

Contribution

It introduces an agentic LLM-based pipeline that automates MDP modeling and policy generation, ensuring semantic alignment and high performance across domains.

Findings

01

A-LAMP outperforms single state-of-the-art LLM models in policy generation.

02

Even lightweight variants of A-LAMP approach larger models' performance.

03

The framework maintains task optimality and semantic correctness.

Abstract

Applying reinforcement learning (RL) to real-world tasks requires converting informal descriptions into a formal Markov decision process (MDP), implementing an executable environment, and training a policy agent. Automating this process is challenging due to modeling errors, fragile code, and misaligned objectives, which often impede policy training. We introduce an agentic large language model (LLM)-based framework for automated MDP modeling and policy generation (A-LAMP), that automatically translates free-form natural language task descriptions into an MDP formulation and trained policy. The framework decomposes modeling, coding, and training into verifiable stages, ensuring semantic alignment throughout the pipeline. Across both classic control and custom RL domains, A-LAMP consistently achieves higher policy generation capability than a single state-of-the-art LLM model. Notably,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Topic Modeling · Multimodal Machine Learning Applications