Combinatorial Pure Exploration with Full-bandit Feedback and Beyond: Solving Combinatorial Optimization under Uncertainty with Limited Observation
Yuko Kuroki, Junya Honda, Masashi Sugiyama

TL;DR
This paper reviews recent techniques for solving combinatorial optimization problems under uncertainty with limited feedback, addressing practical constraints like privacy and budget limits in applications such as recommender systems and networks.
Contribution
It provides a comprehensive review of recent methods for combinatorial pure exploration with limited feedback, extending beyond traditional semi-bandit feedback assumptions.
Findings
Summarizes recent algorithms for limited feedback scenarios.
Highlights challenges and solutions in practical applications.
Identifies open problems and future research directions.
Abstract
Combinatorial optimization is one of the fundamental research fields that has been extensively studied in theoretical computer science and operations research. When developing an algorithm for combinatorial optimization, it is commonly assumed that parameters such as edge weights are exactly known as inputs. However, this assumption may not be fulfilled since input parameters are often uncertain or initially unknown in many applications such as recommender systems, crowdsourcing, communication networks, and online advertisement. To resolve such uncertainty, the problem of combinatorial pure exploration of multi-armed bandits (CPE) and its variants have recieved increasing attention. Earlier work on CPE has studied the semi-bandit feedback or assumed that the outcome from each individual edge is always accessible at all rounds. However, due to practical constraints such as a budget…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Auction Theory and Applications
MethodsCollaborative Preference Embedding
