Distributed optimization of deeply nested systems
Miguel \'A. Carreira-Perpi\~n\'an, Weiran Wang

TL;DR
This paper introduces the method of auxiliary coordinates (MAC), a novel approach for efficiently training deeply nested systems by transforming the complex optimization problem into a constrained one, enabling parallelization and improved convergence.
Contribution
The paper presents MAC, a general mathematical strategy that simplifies the optimization of nested systems, allowing for parallel computation and applicability even without parameter derivatives.
Findings
MAC converges provably and efficiently.
Enables parallel and distributed optimization of nested systems.
Performs competitively with state-of-the-art optimizers.
Abstract
In science and engineering, intelligent processing of complex signals such as images, sound or language is often performed by a parameterized hierarchy of nonlinear processing layers, sometimes biologically inspired. Hierarchical systems (or, more generally, nested systems) offer a way to generate complex mappings using simple stages. Each layer performs a different operation and achieves an ever more sophisticated representation of the input, as, for example, in an deep artificial neural network, an object recognition cascade in computer vision or a speech front-end processing. Joint estimation of the parameters of all the layers and selection of an optimal architecture is widely considered to be a difficult numerical nonconvex optimization problem, difficult to parallelize for execution in a distributed computation environment, and requiring significant human expert effort, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Neural Networks and Applications · Blind Source Separation Techniques
