Utilization of domain knowledge to improve POMDP belief estimation

Tung Nguyen; Johane Takeuchi

arXiv:2302.08748·cs.AI·February 20, 2023

Utilization of domain knowledge to improve POMDP belief estimation

Tung Nguyen, Johane Takeuchi

PDF

Open Access

TL;DR

This paper introduces a novel method that incorporates domain knowledge into POMDP belief updates using Jeffrey's rule, enhancing policy learning efficiency and performance in decision-making under uncertainty.

Contribution

The paper presents a new approach for integrating domain knowledge into POMDP belief estimation via Jeffrey's rule, reducing data needs and improving RL policy performance.

Findings

01

Domain knowledge integration improves belief estimation accuracy.

02

Reduces data requirements for effective POMDP policy learning.

03

Enhances RL policy performance in uncertain environments.

Abstract

The partially observable Markov decision process (POMDP) framework is a common approach for decision making under uncertainty. Recently, multiple studies have shown that by integrating relevant domain knowledge into POMDP belief estimation, we can improve the learned policy's performance. In this study, we propose a novel method for integrating the domain knowledge into probabilistic belief update in POMDP framework using Jeffrey's rule and normalization. We show that the domain knowledge can be utilized to reduce the data requirement and improve performance for POMDP policy learning with RL.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference