Probabilistic Planning with Partially Ordered Preferences over Temporal   Goals

Hazhar Rahmani; Abhishek N. Kulkarni; and Jie Fu

arXiv:2209.12267·cs.RO·March 9, 2023

Probabilistic Planning with Partially Ordered Preferences over Temporal Goals

Hazhar Rahmani, Abhishek N. Kulkarni, and Jie Fu

PDF

Open Access

TL;DR

This paper introduces a novel approach for probabilistic planning in MDPs with partial order preferences over temporal goals, using preference automata and Pareto-optimal policies, advancing flexible goal specification.

Contribution

It proposes a new preference automaton for partial orders, translating preferences into multi-objective MDPs, and proves Pareto optimality of the resulting policies.

Findings

01

The algorithm effectively handles partial order preferences.

02

Preference automaton accurately models user preferences.

03

Policies derived are Pareto-optimal in the multi-objective framework.

Abstract

In this paper, we study planning in stochastic systems, modeled as Markov decision processes (MDPs), with preferences over temporally extended goals. Prior work on temporal planning with preferences assumes that the user preferences form a total order, meaning that every pair of outcomes are comparable with each other. In this work, we consider the case where the preferences over possible outcomes are a partial order rather than a total order. We first introduce a variant of deterministic finite automaton, referred to as a preference DFA, for specifying the user's preferences over temporally extended goals. Based on the order theory, we translate the preference DFA to a preference relation over policies for probabilistic planning in a labeled MDP. In this treatment, a most preferred policy induces a weak-stochastic nondominated probability distribution over the finite paths in the MDP.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFormal Methods in Verification · Bayesian Modeling and Causal Inference · Reinforcement Learning in Robotics

MethodsDirect Feedback Alignment