Training Stronger Baselines for Learning to Optimize

Tianlong Chen; Weiyi Zhang; Jingyang Zhou; Shiyu Chang; Sijia Liu,; Lisa Amini; Zhangyang Wang

arXiv:2010.09089·cs.LG·October 20, 2020·6 cites

Training Stronger Baselines for Learning to Optimize

Tianlong Chen, Weiyi Zhang, Jingyang Zhou, Shiyu Chang, Sijia Liu,, Lisa Amini, Zhangyang Wang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces improved training techniques for learning to optimize models, significantly enhancing their performance and stability without altering their architectures, and demonstrating that even simple models can outperform complex ones.

Contribution

The paper proposes a progressive training scheme and off-policy imitation learning to better train L2O models, bridging the gap between their potential and current performance.

Findings

01

Enhanced training techniques boost L2O model performance.

02

Simple L2O models can outperform complex ones with better training.

03

Training improvements are model-agnostic and easy to implement.

Abstract

Learning to optimize (L2O) has gained increasing attention since classical optimizers require laborious problem-specific design and hyperparameter tuning. However, there is a gap between the practical demand and the achievable performance of existing L2O models. Specifically, those learned optimizers are applicable to only a limited class of problems, and often exhibit instability. With many efforts devoted to designing more sophisticated L2O models, we argue for another orthogonal, under-explored theme: the training techniques for those L2O models. We show that even the simplest L2O model could have been trained much better. We first present a progressive training scheme to gradually increase the optimizer unroll length, to mitigate a well-known L2O dilemma of truncation bias (shorter unrolling) versus gradient explosion (longer unrolling). We further leverage off-policy imitation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

VITA-Group/L2O-Training-Techniques
tfOfficial

Videos

Training Stronger Baselines for Learning to Optimize· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Advanced Neural Network Applications