Bregman Centroid Guided Cross-Entropy Method

Yuliang Gu; Hongpeng Cao; Marco Caccamo; Naira Hovakimyan

arXiv:2506.02205·cs.LG·July 2, 2025

Bregman Centroid Guided Cross-Entropy Method

Yuliang Gu, Hongpeng Cao, Marco Caccamo, Naira Hovakimyan

PDF

Open Access

TL;DR

This paper introduces Bregman Centroid Guided CEM, an enhancement to the Cross-Entropy Method that improves convergence and solution quality in multimodal optimization tasks within model-based reinforcement learning.

Contribution

It proposes a novel Bregman centroid approach for ensemble CEM, enabling better diversity and information aggregation with minimal computational overhead.

Findings

01

Improves convergence speed in synthetic benchmarks.

02

Enhances solution quality in navigation tasks.

03

Seamless integration into existing CEM pipelines.

Abstract

The Cross-Entropy Method (CEM) is a widely adopted trajectory optimizer in model-based reinforcement learning (MBRL), but its unimodal sampling strategy often leads to premature convergence in multimodal landscapes. In this work, we propose Bregman Centroid Guided CEM ( $BC$ -EvoCEM), a lightweight enhancement to ensemble CEM that leverages $Bregman centroids$ for principled information aggregation and diversity control. $\textbf{$ \mathcal{BC} $-EvoCEM}$ computes a performance-weighted Bregman centroid across CEM workers and updates the least contributing ones by sampling within a trust region around the centroid. Leveraging the duality between Bregman divergences and exponential family distributions, we show that $\textbf{$ \mathcal{BC} $-EvoCEM}$ integrates seamlessly into standard CEM pipelines with negligible overhead. Empirical results on synthetic benchmarks, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInfrared Target Detection Methodologies · Thermography and Photoacoustic Techniques · Image and Signal Denoising Methods