Optimizer Amalgamation
Tianshu Huang, Tianlong Chen, Sijia Liu, Shiyu Chang, Lisa Amini,, Zhangyang Wang

TL;DR
This paper introduces a novel approach called Optimizer Amalgamation, which combines multiple optimizers into a single learnable optimizer using differentiable mechanisms and variance reduction techniques, demonstrating improved performance over individual optimizers.
Contribution
It proposes a new method to amalgamate multiple optimizers into one using gradient-based learning and variance reduction, advancing the field of learning to optimize.
Findings
Amalgamated optimizer outperforms individual optimizers.
Variance reduction improves amalgamation stability.
The approach surpasses existing learning to optimize baselines.
Abstract
Selecting an appropriate optimizer for a given problem is of major interest for researchers and practitioners. Many analytical optimizers have been proposed using a variety of theoretical and empirical approaches; however, none can offer a universal advantage over other competitive optimizers. We are thus motivated to study a new problem named Optimizer Amalgamation: how can we best combine a pool of "teacher" optimizers into a single "student" optimizer that can have stronger problem-specific performance? In this paper, we draw inspiration from the field of "learning to optimize" to use a learnable amalgamation target. First, we define three differentiable amalgamation mechanisms to amalgamate a pool of analytical optimizers by gradient descent. Then, in order to reduce variance of the amalgamation process, we also explore methods to stabilize the amalgamation process by perturbing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and ELM · Metaheuristic Optimization Algorithms Research · Advanced Bandit Algorithms Research
