Algebraic optimization of sequential decision problems

Mareike Dressler; Marina Garrote-L\'opez; Guido Mont\'ufar; Johannes; M\"uller; Kemal Rose

arXiv:2211.09439·math.OC·November 18, 2022

Algebraic optimization of sequential decision problems

Mareike Dressler, Marina Garrote-L\'opez, Guido Mont\'ufar, Johannes, M\"uller, Kemal Rose

PDF

Open Access 1 Repo

TL;DR

This paper explores algebraic methods for optimizing long-term rewards in partially observable Markov decision processes, providing new bounds and computational techniques for solving the resulting quadratic-constrained linear problems.

Contribution

It introduces an algebraic characterization of the feasible set for state aggregation problems and analyzes the critical points of the optimization problem.

Findings

01

Derived bounds on the number of critical points.

02

Compared algebraic solutions to traditional optimization methods.

03

Validated theoretical bounds through experiments.

Abstract

We study the optimization of the expected long-term reward in finite partially observable Markov decision processes over the set of stationary stochastic policies. In the case of deterministic observations, also known as state aggregation, the problem is equivalent to optimizing a linear objective subject to quadratic constraints. We characterize the feasible set of this problem as the intersection of a product of affine varieties of rank one matrices and a polytope. Based on this description, we obtain bounds on the number of critical points of the optimization problem. Finally, we conduct experiments in which we solve the KKT equations or the Lagrange equations over different boundary components of the feasible set, and compare the result to the theoretical bounds and to other constrained optimization methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

marinagarrote/algebraic-optimization-of-sequential-decision-rules
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReceptor Mechanisms and Signaling · Bayesian Modeling and Causal Inference · Game Theory and Applications