Active Inference in Contextual Multi-Armed Bandits for Autonomous Robotic Exploration

Shohei Wakayama; Alberto Candela; Paul Hayne; Nisar Ahmed

arXiv:2408.04119·cs.RO·June 10, 2025

Active Inference in Contextual Multi-Armed Bandits for Autonomous Robotic Exploration

Shohei Wakayama, Alberto Candela, Paul Hayne, Nisar Ahmed

PDF

Open Access

TL;DR

This paper demonstrates that active inference effectively balances exploration and exploitation in contextual multi-armed bandit problems for autonomous robotic exploration, especially in realistic noisy environments, outperforming standard methods.

Contribution

It applies neuro-inspired active inference to real-world scenarios, specifically mineralogical survey site selection, showing improved efficiency and adaptability over traditional bandit strategies.

Findings

01

Active inference requires fewer iterations than standard bandit approaches.

02

It adapts effectively to changing expert preferences.

03

Performance is robust in noisy, biased real-world data.

Abstract

Autonomous selection of optimal options for data collection from multiple alternatives is challenging in uncertain environments. When secondary information about options is accessible, such problems can be framed as contextual multi-armed bandits (CMABs). Neuro-inspired active inference has gained interest for its ability to balance exploration and exploitation using the expected free energy objective function. Unlike previous studies that showed the effectiveness of active inference based strategy for CMABs using synthetic data, this study aims to apply active inference to realistic scenarios, using a simulated mineralogical survey site selection problem. Hyperspectral data from AVIRIS-NG at Cuprite, Nevada, serves as contextual information for predicting outcome probabilities, while geologists' mineral labels represent outcomes. Monte Carlo simulations assess the robustness of active…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Data Stream Mining Techniques