Controlled Online Optimization Learning (COOL): Finding the ground state of spin Hamiltonians with reinforcement learning
Kyle Mills, Pooya Ronagh, Isaac Tamblyn

TL;DR
This paper introduces a reinforcement learning approach to optimize simulated annealing for finding ground states of spin Hamiltonians, outperforming traditional schedules and generalizing across Hamiltonian classes.
Contribution
The paper presents a novel RL-based method that learns adaptive temperature schedules for annealing, surpassing heuristic methods and demonstrating scalability and robustness.
Findings
RL-driven annealing outperforms standard schedules
The method generalizes to unseen Hamiltonians within a class
Achieves an order of magnitude improvement in scaling performance
Abstract
Reinforcement learning (RL) has become a proven method for optimizing a procedure for which success has been defined, but the specific actions needed to achieve it have not. We apply the so-called "black box" method of RL to what has been referred as the "black art" of simulated annealing (SA), demonstrating that an RL agent based on proximal policy optimization can, through experience alone, arrive at a temperature schedule that surpasses the performance of standard heuristic temperature schedules for two classes of Hamiltonians. When the system is initialized at a cool temperature, the RL agent learns to heat the system to "melt" it, and then slowly cool it in an effort to anneal to the ground state; if the system is initialized at a high temperature, the algorithm immediately cools the system. We investigate the performance of our RL-driven SA agent in generalizing to all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Quantum Computing Algorithms and Architecture · Reinforcement Learning in Robotics
