Reinforcement-learning-based Algorithms for Optimization Problems and   Applications to Inverse Problems

Chen Xu; Yun-Bin Zhao; Zhipeng Lu; Ye Zhang

arXiv:2310.06711·math.OC·January 27, 2025

Reinforcement-learning-based Algorithms for Optimization Problems and Applications to Inverse Problems

Chen Xu, Yun-Bin Zhao, Zhipeng Lu, Ye Zhang

PDF

Open Access

TL;DR

This paper introduces REINFORCE-OPT, a reinforcement learning-based iterative algorithm for optimization and inverse problems, demonstrating superior performance and robustness over traditional methods, with applications to nonlinear inverse problems and uncertainty quantification.

Contribution

The paper presents a novel RL-based optimization algorithm, REINFORCE-OPT, and establishes its theoretical convergence, practical advantages, and applicability to inverse problems with uncertainty quantification.

Findings

01

REINFORCE-OPT outperforms gradient descent, genetic algorithms, and particle swarm optimization.

02

The method effectively escapes local optima and is robust to initial conditions.

03

It can quantify uncertainty and identify multiple solutions in ill-posed inverse problems.

Abstract

We design a new iterative algorithm, called REINFORCE-OPT, for solving a general type of optimization problems. This algorithm parameterizes the solution search rule and iteratively updates the parameter using a reinforcement learning (RL) algorithm resembling REINFORCE. To gain a deeper understanding of the RL-based methods, we show that REINFORCE-OPT essentially solves a stochastic version of the given optimization problem, and that under standard assumptions, the searching rule parameter almost surely converges to a locally optimal value. Experiments show that REINFORCE-OPT outperforms other optimization methods such as gradient descent, the genetic algorithm, and particle swarm optimization, via its ability to escape from locally optimal solutions and its robustness to the choice of initial values. With rigorous derivations, we formally introduce the use of reinforcement learning to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNumerical methods in inverse problems · Statistical and numerical algorithms · Control Systems and Identification