LLM-assisted Semantic Option Discovery for Facilitating Adaptive Deep Reinforcement Learning

Chang Yao; Jinghui Qin; Kebing Jin; Hankz Hankui Zhuo

arXiv:2603.01488·cs.AI·March 10, 2026

LLM-assisted Semantic Option Discovery for Facilitating Adaptive Deep Reinforcement Learning

Chang Yao, Jinghui Qin, Kebing Jin, Hankz Hankui Zhuo

PDF

Open Access

TL;DR

This paper presents a novel framework leveraging Large Language Models to enhance deep reinforcement learning by enabling semantic skill reuse, real-time constraint monitoring, and improving data efficiency and transferability across environments.

Contribution

The paper introduces an LLM-driven closed-loop framework that maps natural language instructions into executable rules and semantically annotates options for better exploration and transfer in DRL.

Findings

01

Improved data efficiency in DRL tasks.

02

Enhanced cross-environment transferability.

03

Better constraint compliance and interpretability.

Abstract

Despite achieving remarkable success in complex tasks, Deep Reinforcement Learning (DRL) is still suffering from critical issues in practical applications, such as low data efficiency, lack of interpretability, and limited cross-environment transferability. However, the learned policy generating actions based on states are sensitive to the environmental changes, struggling to guarantee behavioral safety and compliance. Recent research shows that integrating Large Language Models (LLMs) with symbolic planning is promising in addressing these challenges. Inspired by this, we introduce a novel LLM-driven closed-loop framework, which enables semantic-driven skill reuse and real-time constraint monitoring by mapping natural language instructions into executable rules and semantically annotating automatically created options. The proposed approach utilizes the general knowledge of LLMs to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Machine Learning in Healthcare