A method for the online construction of the set of states of a Markov   Decision Process using Answer Set Programming

Leonardo A. Ferreira; Reinaldo A. C. Bianchi; Paulo E. Santos; Ramon; Lopez de Mantaras

arXiv:1706.01417·cs.AI·June 6, 2017·2 cites

A method for the online construction of the set of states of a Markov Decision Process using Answer Set Programming

Leonardo A. Ferreira, Reinaldo A. C. Bianchi, Paulo E. Santos, Ramon, Lopez de Mantaras

PDF

Open Access

TL;DR

This paper introduces oASP(MDP), a novel method combining Markov Decision Processes and Answer Set Programming to dynamically construct state sets in non-stationary environments, enhancing policy adaptation.

Contribution

The paper proposes a new approach that integrates ASP with MDPs and RL to efficiently handle changing domains during decision-making.

Findings

01

oASP(MDP) effectively constructs state sets in non-stationary environments.

02

The method updates policies without disrupting RL action-value approximations.

03

Results demonstrate improved adaptability in dynamic domains.

Abstract

Non-stationary domains, that change in unpredicted ways, are a challenge for agents searching for optimal policies in sequential decision-making problems. This paper presents a combination of Markov Decision Processes (MDP) with Answer Set Programming (ASP), named {\em Online ASP for MDP} (oASP(MDP)), which is a method capable of constructing the set of domain states while the agent interacts with a changing environment. oASP(MDP) updates previously obtained policies, learnt by means of Reinforcement Learning (RL), using rules that represent the domain changes observed by the agent. These rules represent a set of domain constraints that are processed as ASP programs reducing the search space. Results show that oASP(MDP) is capable of finding solutions for problems in non-stationary domains without interfering with the action-value function approximation process.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLogic, Reasoning, and Knowledge · Multi-Agent Systems and Negotiation · Reinforcement Learning in Robotics