Exploring Large Action Sets with Hyperspherical Embeddings using von Mises-Fisher Sampling

Walid Bendada; Guillaume Salha-Galvan; Romain Hennequin; Th\'eo Bontempelli; Thomas Bouab\c{c}a; Tristan Cazenave

arXiv:2507.00518·cs.LG·July 2, 2025

Exploring Large Action Sets with Hyperspherical Embeddings using von Mises-Fisher Sampling

Walid Bendada, Guillaume Salha-Galvan, Romain Hennequin, Th\'eo Bontempelli, Thomas Bouab\c{c}a, Tristan Cazenave

PDF

Open Access

TL;DR

This paper presents vMF-exp, a scalable exploration method for large action sets in reinforcement learning using hyperspherical embeddings and von Mises-Fisher sampling, outperforming traditional methods in scalability and effectiveness.

Contribution

The paper introduces vMF-exp, a novel scalable exploration technique leveraging von Mises-Fisher distributions for hyperspherical action embeddings, addressing scalability issues of existing methods.

Findings

01

vMF-exp scales to virtually unlimited actions

02

Theoretically maintains exploration probabilities similar to Boltzmann Exploration

03

Empirically effective in simulated, real-world, and large-scale deployment scenarios

Abstract

This paper introduces von Mises-Fisher exploration (vMF-exp), a scalable method for exploring large action sets in reinforcement learning problems where hyperspherical embedding vectors represent these actions. vMF-exp involves initially sampling a state embedding representation using a von Mises-Fisher distribution, then exploring this representation's nearest neighbors, which scales to virtually unlimited numbers of candidate actions. We show that, under theoretical assumptions, vMF-exp asymptotically maintains the same probability of exploring each action as Boltzmann Exploration (B-exp), a popular alternative that, nonetheless, suffers from scalability issues as it requires computing softmax values for each action. Consequently, vMF-exp serves as a scalable alternative to B-exp for exploring large action sets with hyperspherical embeddings. Experiments on simulated data, real-world…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Complex Network Analysis Techniques · Generative Adversarial Networks and Image Synthesis