Efficient Algorithms for Planning with Participation Constraints

Hanrui Zhang; Yu Cheng; Vincent Conitzer

arXiv:2205.07767·cs.GT·May 17, 2022

Efficient Algorithms for Planning with Participation Constraints

Hanrui Zhang, Yu Cheng, Vincent Conitzer

PDF

Open Access

TL;DR

This paper introduces the first polynomial-time exact algorithms for planning with participation constraints in finite-horizon Markov decision processes, ensuring the principal's utility while maintaining agent participation.

Contribution

It provides the first polynomial-time exact algorithm for finite-horizon planning with participation constraints, extending to approximate solutions for infinite-horizon cases.

Findings

01

First polynomial-time exact algorithm for finite-horizon case

02

Extension to approximate infinite-horizon algorithms

03

Efficient computation of policies satisfying participation constraints

Abstract

We consider the problem of planning with participation constraints introduced in [Zhang et al., 2022]. In this problem, a principal chooses actions in a Markov decision process, resulting in separate utilities for the principal and the agent. However, the agent can and will choose to end the process whenever his expected onward utility becomes negative. The principal seeks to compute and commit to a policy that maximizes her expected utility, under the constraint that the agent should always want to continue participating. We provide the first polynomial-time exact algorithm for this problem for finite-horizon settings, where previously only an additive $ε$ -approximation algorithm was known. Our approach can also be extended to the (discounted) infinite-horizon case, for which we give an algorithm that runs in time polynomial in the size of the input and $lo g (1/ ε)$ ,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Search Problems · Logic, Reasoning, and Knowledge · Reinforcement Learning in Robotics