Online Learning with Costly Features in Non-stationary Environments
Saeed Ghoorchian, Evgenii Kortukov, Setareh Maghsudi

TL;DR
This paper introduces a novel approach to online learning in non-stationary environments where features are costly to observe, extending contextual bandits to optimize long-term rewards while balancing observation costs.
Contribution
It proposes a new algorithm for costly feature observation in non-stationary settings, achieving sublinear regret and improving decision-making efficiency.
Findings
The algorithm outperforms existing methods in real-world scenarios.
It effectively balances observation costs and reward maximization.
Demonstrates robustness to abrupt changes in environment dynamics.
Abstract
Maximizing long-term rewards is the primary goal in sequential decision-making problems. The majority of existing methods assume that side information is freely available, enabling the learning agent to observe all features' states before making a decision. In real-world problems, however, collecting beneficial information is often costly. That implies that, besides individual arms' reward, learning the observations of the features' states is essential to improve the decision-making strategy. The problem is aggravated in a non-stationary environment where reward and cost distributions undergo abrupt changes over time. To address the aforementioned dual learning problem, we extend the contextual bandit setting and allow the agent to observe subsets of features' states. The objective is to maximize the long-term average gain, which is the difference between the accumulated rewards and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management · Data Stream Mining Techniques
