Bregman Centroid Guided Cross-Entropy Method
Yuliang Gu, Hongpeng Cao, Marco Caccamo, Naira Hovakimyan

TL;DR
This paper introduces Bregman Centroid Guided CEM, an enhancement to the Cross-Entropy Method that improves convergence and solution quality in multimodal optimization tasks within model-based reinforcement learning.
Contribution
It proposes a novel Bregman centroid approach for ensemble CEM, enabling better diversity and information aggregation with minimal computational overhead.
Findings
Improves convergence speed in synthetic benchmarks.
Enhances solution quality in navigation tasks.
Seamless integration into existing CEM pipelines.
Abstract
The Cross-Entropy Method (CEM) is a widely adopted trajectory optimizer in model-based reinforcement learning (MBRL), but its unimodal sampling strategy often leads to premature convergence in multimodal landscapes. In this work, we propose Bregman Centroid Guided CEM (-EvoCEM), a lightweight enhancement to ensemble CEM that leverages for principled information aggregation and diversity control. \textbf{\mathcal{BC}-EvoCEM} computes a performance-weighted Bregman centroid across CEM workers and updates the least contributing ones by sampling within a trust region around the centroid. Leveraging the duality between Bregman divergences and exponential family distributions, we show that \textbf{\mathcal{BC}-EvoCEM} integrates seamlessly into standard CEM pipelines with negligible overhead. Empirical results on synthetic benchmarks, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrared Target Detection Methodologies · Thermography and Photoacoustic Techniques · Image and Signal Denoising Methods
