Reinforcement-learning-based Algorithms for Optimization Problems and Applications to Inverse Problems
Chen Xu, Yun-Bin Zhao, Zhipeng Lu, Ye Zhang

TL;DR
This paper introduces REINFORCE-OPT, a reinforcement learning-based iterative algorithm for optimization and inverse problems, demonstrating superior performance and robustness over traditional methods, with applications to nonlinear inverse problems and uncertainty quantification.
Contribution
The paper presents a novel RL-based optimization algorithm, REINFORCE-OPT, and establishes its theoretical convergence, practical advantages, and applicability to inverse problems with uncertainty quantification.
Findings
REINFORCE-OPT outperforms gradient descent, genetic algorithms, and particle swarm optimization.
The method effectively escapes local optima and is robust to initial conditions.
It can quantify uncertainty and identify multiple solutions in ill-posed inverse problems.
Abstract
We design a new iterative algorithm, called REINFORCE-OPT, for solving a general type of optimization problems. This algorithm parameterizes the solution search rule and iteratively updates the parameter using a reinforcement learning (RL) algorithm resembling REINFORCE. To gain a deeper understanding of the RL-based methods, we show that REINFORCE-OPT essentially solves a stochastic version of the given optimization problem, and that under standard assumptions, the searching rule parameter almost surely converges to a locally optimal value. Experiments show that REINFORCE-OPT outperforms other optimization methods such as gradient descent, the genetic algorithm, and particle swarm optimization, via its ability to escape from locally optimal solutions and its robustness to the choice of initial values. With rigorous derivations, we formally introduce the use of reinforcement learning to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNumerical methods in inverse problems · Statistical and numerical algorithms · Control Systems and Identification
