A Recursively Recurrent Neural Network (R2N2) Architecture for Learning Iterative Algorithms
Danimir T. Doncevic, Alexander Mitsos, Yue Guo, Qianxiao Li, Felix, Dietrich, Manuel Dahmen, Ioannis G. Kevrekidis

TL;DR
This paper introduces R2N2, a recursive neural network architecture designed to learn and adapt iterative algorithms for solving various computational problems, demonstrating similarities to classical solvers.
Contribution
The paper generalizes the Runge-Kutta neural network to a recursive superstructure that modularly learns iterative algorithms, bridging neural networks and classical numerical methods.
Findings
R2N2 learns Krylov-like solvers for linear systems
It develops Newton-Krylov methods for nonlinear problems
It mimics Runge-Kutta integrators for differential equations
Abstract
Meta-learning of numerical algorithms for a given task consists of the data-driven identification and adaptation of an algorithmic structure and the associated hyperparameters. To limit the complexity of the meta-learning problem, neural architectures with a certain inductive bias towards favorable algorithmic structures can, and should, be used. We generalize our previously introduced Runge-Kutta neural network to a recursively recurrent neural network (R2N2) superstructure for the design of customized iterative algorithms. In contrast to off-the-shelf deep learning approaches, it features a distinct division into modules for generation of information and for the subsequent assembly of this information towards a solution. Local information in the form of a subspace is generated by subordinate, inner, iterations of recurrent function evaluations starting at the current outer iterate.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks
