Towards Cost Sensitive Decision Making
Yang Li, Junier Oliva

TL;DR
This paper introduces Active-Acquisition POMDPs, enabling RL agents to actively select features to acquire, balancing costs and rewards, and employs a deep generative model for feature imputation to improve decision-making.
Contribution
It proposes a novel Active-Acquisition POMDP framework and hierarchical RL algorithms utilizing deep generative models for feature imputation, advancing decision-making in uncertain environments.
Findings
Outperforms existing POMDP-RL methods in experiments
Effectively balances feature acquisition costs and decision rewards
Uses deep generative models to improve belief estimation
Abstract
Many real-world situations allow for the acquisition of additional relevant information when making decisions with limited or uncertain data. However, traditional RL approaches either require all features to be acquired beforehand (e.g. in a MDP) or regard part of them as missing data that cannot be acquired (e.g. in a POMDP). In this work, we consider RL models that may actively acquire features from the environment to improve the decision quality and certainty, while automatically balancing the cost of feature acquisition process and the reward of task decision process. We propose the Active-Acquisition POMDP and identify two types of the acquisition process for different application domains. In order to assist the agent in the actively-acquired partially-observed environment and alleviate the exploration-exploitation dilemma, we develop a model-based approach, where a deep generative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Business Intelligence · Complex Systems and Decision Making
