ParMod: A Parallel and Modular Framework for Learning Non-Markovian   Tasks

Ruixuan Miao; Xu Lu; Cong Tian; Bin Yu; Zhenhua Duan

arXiv:2412.12700·cs.LG·December 18, 2024

ParMod: A Parallel and Modular Framework for Learning Non-Markovian Tasks

Ruixuan Miao, Xu Lu, Cong Tian, Bin Yu, Zhenhua Duan

PDF

Open Access

TL;DR

ParMod introduces a parallel and modular reinforcement learning framework designed to efficiently learn non-Markovian tasks specified by temporal logic, addressing challenges of long-term dependencies and reward sparsity.

Contribution

The paper presents a novel framework that modularizes non-Markovian tasks and trains agents in parallel, improving sample efficiency and performance in complex temporal logic-based tasks.

Findings

01

ParMod outperforms existing methods on benchmark problems.

02

The modular approach enhances learning efficiency for non-Markovian tasks.

03

Parallel training accelerates convergence and improves task performance.

Abstract

The commonly used Reinforcement Learning (RL) model, MDPs (Markov Decision Processes), has a basic premise that rewards depend on the current state and action only. However, many real-world tasks are non-Markovian, which has long-term memory and dependency. The reward sparseness problem is further amplified in non-Markovian scenarios. Hence learning a non-Markovian task (NMT) is inherently more difficult than learning a Markovian one. In this paper, we propose a novel \textbf{Par}allel and \textbf{Mod}ular RL framework, ParMod, specifically for learning NMTs specified by temporal logic. With the aid of formal techniques, the NMT is modulaized into a series of sub-tasks based on the automaton structure (equivalent to its temporal logic counterpart). On this basis, sub-tasks will be trained by a group of agents in a parallel fashion, with one agent handling one sub-task. Besides parallel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Data Stream Mining Techniques · Machine Learning and Data Classification