An efficient mixed-integer linear programming formulation for solving influence diagrams
Topias Terho, Fabricio Oliveira, Ahti Salo, Pedro Munari

TL;DR
This paper introduces a new, efficient mixed-integer linear programming formulation tailored for complex influence diagrams, enhancing computational performance especially for non-sequential problems with large state spaces.
Contribution
The paper proposes a novel MILP formulation for influence diagrams that outperforms existing models in computational efficiency for challenging problem structures.
Findings
Proposed MILP formulation solves influence diagrams more efficiently.
Significant performance improvements over existing MILP models.
Flexible modeling including conditional value-at-risk and logical constraints.
Abstract
Influence diagrams represent decision-making problems with interdependencies between random events, decisions, and consequences. Traditionally, they have been solved using algorithms that determine the expected utility-maximizing decision strategy. In contrast, state-of-the-art solution approaches convert influence diagrams into a mixed-integer linear programming (MILP) model, which can be solved with powerful off-the-shelf MILP solvers. From a computational standpoint, the existing MILP formulations can be efficiently solved when applied to influence diagrams that represent periodic (or sequential) decision processes, which can be cast as partially observable Markov Decision Processes. However, they are inefficient in problems that lack a periodic structure or if the nodes in the influence diagram have large state spaces, thus limiting their practical use. In this paper, we present an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Reinforcement Learning in Robotics · Risk and Portfolio Optimization
