Sequential decision problems, dependent types and generic solutions

Nicola Botta; Patrik Jansson; Cezar Ionescu; David R. Christiansen,; Edwin Brady

arXiv:1610.07145·cs.LO·June 22, 2023·LoG

Sequential decision problems, dependent types and generic solutions

Nicola Botta, Patrik Jansson, Cezar Ionescu, David R. Christiansen,, Edwin Brady

PDF

1 Repo

TL;DR

This paper introduces a formal, computer-verified framework for solving a broad class of finite-horizon sequential decision problems using dependent types, enabling flexible modeling of uncertainties and formal proofs of optimality principles.

Contribution

It provides a generic, dependently-typed implementation for sequential decision problems, including formal proofs of Bellman's principle and backwards induction in Idris and Agda.

Findings

01

Formalization of Bellman's principle of optimality

02

Implementation handles time-dependent control and state spaces

03

Framework applicable to various uncertainty models

Abstract

We present a computer-checked generic implementation for solving finite-horizon sequential decision problems. This is a wide class of problems, including inter-temporal optimizations, knapsack, optimal bracketing, scheduling, etc. The implementation can handle time-step dependent control and state spaces, and monadic representations of uncertainty (such as stochastic, non-deterministic, fuzzy, or combinations thereof). This level of genericity is achievable in a programming language with dependent types (we have used both Idris and Agda). Dependent types are also the means that allow us to obtain a formalization and computer-checked proof of the central component of our implementation: Bellman's principle of optimality and the associated backwards induction algorithm. The formalization clarifies certain aspects of backwards induction and, by making explicit notions such as viability and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nicolabotta/SeqDecProbs
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.