Hamiltonian Monte Carlo Particle Swarm Optimizer

Omatharv Bharat Vaidya (1); Rithvik Terence DSouza (1); Snehanshu Saha; (1); Soma Dhavala (2); Swagatam Das (3); ((1)-BITS Pilani K K Birla Goa; Campus; (2)-MlSqaure Bangalore; (3)- ISI Kolkata)

arXiv:2206.14134·cs.LG·June 29, 2022·1 cites

Hamiltonian Monte Carlo Particle Swarm Optimizer

Omatharv Bharat Vaidya (1), Rithvik Terence DSouza (1), Snehanshu Saha, (1), Soma Dhavala (2), Swagatam Das (3), ((1)-BITS Pilani K K Birla Goa, Campus, (2)-MlSqaure Bangalore, (3)- ISI Kolkata)

PDF

Open Access

TL;DR

The paper presents HMC-PSO, a novel optimization algorithm combining Hamiltonian dynamics with particle swarm principles, enhancing exploration and exploitation especially for non-convex functions and neural network training.

Contribution

It introduces HMC-PSO, integrating Hamiltonian Monte Carlo with PSO, and extends it to approximate gradients for deep neural network optimization.

Findings

01

HMC-PSO effectively explores non-convex functions.

02

It outperforms some state-of-the-art optimizers on benchmark tasks.

03

The method provides a new approach to gradient approximation in DNNs.

Abstract

We introduce the Hamiltonian Monte Carlo Particle Swarm Optimizer (HMC-PSO), an optimization algorithm that reaps the benefits of both Exponentially Averaged Momentum PSO and HMC sampling. The coupling of the position and velocity of each particle with Hamiltonian dynamics in the simulation allows for extensive freedom for exploration and exploitation of the search space. It also provides an excellent technique to explore highly non-convex functions while ensuring efficient sampling. We extend the method to approximate error gradients in closed form for Deep Neural Network (DNN) settings. We discuss possible methods of coupling and compare its performance to that of state-of-the-art optimizers on the Golomb's Ruler problem and Classification tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Stochastic Gradient Optimization Techniques