Neural Risk-sensitive Satisficing in Contextual Bandits

Shogo Ito; Tatsuji Takahashi; Yu Kono

arXiv:2501.08612·cs.LG·January 16, 2025

Neural Risk-sensitive Satisficing in Contextual Bandits

Shogo Ito, Tatsuji Takahashi, Yu Kono

PDF

Open Access

TL;DR

This paper introduces NeuralRS, a neural network-based algorithm for contextual bandits that improves upon previous linear methods by handling complex, non-linear reward relationships in recommendation systems.

Contribution

It extends the RegLinRS algorithm by integrating neural networks, enabling better performance in environments with non-linear feature-reward relationships.

Findings

01

NeuralRS effectively models non-linear reward functions.

02

NeuralRS outperforms linear methods in complex environments.

03

The approach demonstrates improved adaptability in recommendation tasks.

Abstract

The contextual bandit problem, which is a type of reinforcement learning tasks, provides an effective framework for solving challenges in recommendation systems, such as satisfying real-time requirements, enabling personalization, addressing cold-start problems. However, contextual bandit algorithms face challenges since they need to handle large state-action spaces sequentially. These challenges include the high costs for learning and balancing exploration and exploitation, as well as large variations in performance that depend on the domain of application. To address these challenges, Tsuboya et~al. proposed the Regional Linear Risk-sensitive Satisficing (RegLinRS) algorithm. RegLinRS switches between exploration and exploitation based on how well the agent has achieved the target. However, the reward expectations in RegLinRS are linearly approximated based on features, which limits…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDecision-Making and Behavioral Economics · Neural and Behavioral Psychology Studies