A-LAMP: Agentic LLM-Based Framework for Automated MDP Modeling and Policy Generation
Hong Je-Gal, Chan-Bin Yi, Hyun-Suk Lee

TL;DR
A-LAMP is a novel framework that automates the translation of natural language task descriptions into formal MDP models and policies using large language models, improving accuracy and reliability in reinforcement learning applications.
Contribution
It introduces an agentic LLM-based pipeline that automates MDP modeling and policy generation, ensuring semantic alignment and high performance across domains.
Findings
A-LAMP outperforms single state-of-the-art LLM models in policy generation.
Even lightweight variants of A-LAMP approach larger models' performance.
The framework maintains task optimality and semantic correctness.
Abstract
Applying reinforcement learning (RL) to real-world tasks requires converting informal descriptions into a formal Markov decision process (MDP), implementing an executable environment, and training a policy agent. Automating this process is challenging due to modeling errors, fragile code, and misaligned objectives, which often impede policy training. We introduce an agentic large language model (LLM)-based framework for automated MDP modeling and policy generation (A-LAMP), that automatically translates free-form natural language task descriptions into an MDP formulation and trained policy. The framework decomposes modeling, coding, and training into verifiable stages, ensuring semantic alignment throughout the pipeline. Across both classic control and custom RL domains, A-LAMP consistently achieves higher policy generation capability than a single state-of-the-art LLM model. Notably,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Topic Modeling · Multimodal Machine Learning Applications
