Hamiltonian Monte Carlo Particle Swarm Optimizer
Omatharv Bharat Vaidya (1), Rithvik Terence DSouza (1), Snehanshu Saha, (1), Soma Dhavala (2), Swagatam Das (3), ((1)-BITS Pilani K K Birla Goa, Campus, (2)-MlSqaure Bangalore, (3)- ISI Kolkata)

TL;DR
The paper presents HMC-PSO, a novel optimization algorithm combining Hamiltonian dynamics with particle swarm principles, enhancing exploration and exploitation especially for non-convex functions and neural network training.
Contribution
It introduces HMC-PSO, integrating Hamiltonian Monte Carlo with PSO, and extends it to approximate gradients for deep neural network optimization.
Findings
HMC-PSO effectively explores non-convex functions.
It outperforms some state-of-the-art optimizers on benchmark tasks.
The method provides a new approach to gradient approximation in DNNs.
Abstract
We introduce the Hamiltonian Monte Carlo Particle Swarm Optimizer (HMC-PSO), an optimization algorithm that reaps the benefits of both Exponentially Averaged Momentum PSO and HMC sampling. The coupling of the position and velocity of each particle with Hamiltonian dynamics in the simulation allows for extensive freedom for exploration and exploitation of the search space. It also provides an excellent technique to explore highly non-convex functions while ensuring efficient sampling. We extend the method to approximate error gradients in closed form for Deep Neural Network (DNN) settings. We discuss possible methods of coupling and compare its performance to that of state-of-the-art optimizers on the Golomb's Ruler problem and Classification tasks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Stochastic Gradient Optimization Techniques
