Combinatorial Pure Exploration with Full-bandit Feedback and Beyond:   Solving Combinatorial Optimization under Uncertainty with Limited Observation

Yuko Kuroki; Junya Honda; Masashi Sugiyama

arXiv:2012.15584·cs.LG·August 30, 2023

Combinatorial Pure Exploration with Full-bandit Feedback and Beyond: Solving Combinatorial Optimization under Uncertainty with Limited Observation

Yuko Kuroki, Junya Honda, Masashi Sugiyama

PDF

Open Access

TL;DR

This paper reviews recent techniques for solving combinatorial optimization problems under uncertainty with limited feedback, addressing practical constraints like privacy and budget limits in applications such as recommender systems and networks.

Contribution

It provides a comprehensive review of recent methods for combinatorial pure exploration with limited feedback, extending beyond traditional semi-bandit feedback assumptions.

Findings

01

Summarizes recent algorithms for limited feedback scenarios.

02

Highlights challenges and solutions in practical applications.

03

Identifies open problems and future research directions.

Abstract

Combinatorial optimization is one of the fundamental research fields that has been extensively studied in theoretical computer science and operations research. When developing an algorithm for combinatorial optimization, it is commonly assumed that parameters such as edge weights are exactly known as inputs. However, this assumption may not be fulfilled since input parameters are often uncertain or initially unknown in many applications such as recommender systems, crowdsourcing, communication networks, and online advertisement. To resolve such uncertainty, the problem of combinatorial pure exploration of multi-armed bandits (CPE) and its variants have recieved increasing attention. Earlier work on CPE has studied the semi-bandit feedback or assumed that the outcome from each individual edge is always accessible at all rounds. However, due to practical constraints such as a budget…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Auction Theory and Applications

MethodsCollaborative Preference Embedding