Inverse Contextual Bandits: Learning How Behavior Evolves over Time

Alihan H\"uy\"uk; Daniel Jarrett; Mihaela van der Schaar

arXiv:2107.06317·cs.LG·June 9, 2022·1 cites

Inverse Contextual Bandits: Learning How Behavior Evolves over Time

Alihan H\"uy\"uk, Daniel Jarrett, Mihaela van der Schaar

PDF

Open Access 2 Repos

TL;DR

This paper introduces Inverse Contextual Bandits, a method to learn and interpret how decision-making policies evolve over time, especially in non-stationary settings like healthcare, using offline data and formal modeling.

Contribution

It formalizes the problem of learning evolving decision policies as Inverse Contextual Bandits and proposes two algorithms for interpretable, non-stationary policy learning.

Findings

01

Effective in modeling evolving behavior in healthcare data

02

Provides interpretable representations of policy changes

03

Validated on real and simulated liver transplantation data

Abstract

Understanding a decision-maker's priorities by observing their behavior is critical for transparency and accountability in decision processes, such as in healthcare. Though conventional approaches to policy learning almost invariably assume stationarity in behavior, this is hardly true in practice: Medical practice is constantly evolving as clinical professionals fine-tune their knowledge over time. For instance, as the medical community's understanding of organ transplantations has progressed over the years, a pertinent question is: How have actual organ allocation policies been evolving? To give an answer, we desire a policy learning method that provides interpretable representations of decision-making, in particular capturing an agent's non-stationary knowledge of the world, as well as operating in an offline manner. First, we model the evolving behavior of decision-makers in terms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Machine Learning in Healthcare