Data-Driven Online Decision Making with Costly Information Acquisition
Onur Atan, Mihaela van der Schaar

TL;DR
This paper introduces algorithms for online decision making that account for the costs of acquiring information, demonstrating sublinear regret and practical benefits in applications like healthcare and finance.
Contribution
It develops two algorithms, Sim-OOS and Seq-OOS, that integrate information acquisition into online learning with proven regret bounds.
Findings
Both algorithms achieve sublinear regret in time.
Substantial performance improvements shown in a breast cancer case study.
Framework applicable to various domains with costly information gathering.
Abstract
In most real-world settings such as recommender systems, finance, and healthcare, collecting useful information is costly and requires an active choice on the part of the decision maker. The decision-maker needs to learn simultaneously what observations to make and what actions to take. This paper incorporates the information acquisition decision into an online learning framework. We propose two different algorithms for this dual learning problem: Sim-OOS and Seq-OOS where observations are made simultaneously and sequentially, respectively. We prove that both algorithms achieve a regret that is sublinear in time. The developed framework and algorithms can be used in many applications including medical informatics, recommender systems and actionable intelligence in transportation, finance, cyber-security etc., in which collecting information prior to making decisions is costly. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Data Stream Mining Techniques
