Using Subjective Logic to Estimate Uncertainty in Multi-Armed Bandit   Problems

Fabio Massimo Zennaro; Audun J{\o}sang

arXiv:2008.07386·cs.LG·August 18, 2020·6 cites

Using Subjective Logic to Estimate Uncertainty in Multi-Armed Bandit Problems

Fabio Massimo Zennaro, Audun J{\o}sang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel approach using subjective logic to better estimate and manage uncertainty in multi-armed bandit problems, distinguishing between inherent randomness and limited knowledge.

Contribution

It proposes new algorithms based on subjective logic for multi-armed bandits and compares their performance with classical methods, providing insights into uncertainty evaluation.

Findings

01

Subjective logic enables effective uncertainty assessment.

02

New algorithms outperform classical methods in certain scenarios.

03

Insights into the dynamics of epistemic and aleatoric uncertainty.

Abstract

The multi-armed bandit problem is a classical decision-making problem where an agent has to learn an optimal action balancing exploration and exploitation. Properly managing this trade-off requires a correct assessment of uncertainty; in multi-armed bandits, as in other machine learning applications, it is important to distinguish between stochasticity that is inherent to the system (aleatoric uncertainty) and stochasticity that derives from the limited knowledge of the agent (epistemic uncertainty). In this paper we consider the formalism of subjective logic, a concise and expressive framework to express Dirichlet-multinomial models as subjective opinions, and we apply it to the problem of multi-armed bandits. We propose new algorithms grounded in subjective logic to tackle the multi-armed bandit problem, we compare them against classical algorithms from the literature, and we analyze…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

FMZennaro/SLBandits
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Machine Learning and Algorithms