Learning an Efficient Optimizer via Hybrid-Policy Sub-Trajectory Balance

Yunchuan Guan; Yu Liu; Ke Zhou; Hui Li; Sen Jia; Zhiqi Shen; Ziyang Wang; Xinglin Zhang; Tao Chen; Jenq-Neng Hwang; Lei Li

arXiv:2511.00543·cs.LG·November 4, 2025

Learning an Efficient Optimizer via Hybrid-Policy Sub-Trajectory Balance

Yunchuan Guan, Yu Liu, Ke Zhou, Hui Li, Sen Jia, Zhiqi Shen, Ziyang Wang, Xinglin Zhang, Tao Chen, Jenq-Neng Hwang, Lei Li

PDF

Open Access

TL;DR

This paper introduces Lo-Hp, a two-stage weight generation framework that improves optimizer flexibility and efficiency by combining on-policy and off-policy learning, addressing long-horizon and over-coupling issues in neural network weight generation.

Contribution

It proposes a hybrid-policy sub-trajectory balance objective for decoupled weight generation, enabling better local and global optimization policies.

Findings

01

Achieves higher accuracy in transfer and few-shot learning tasks.

02

Demonstrates improved inference efficiency over existing methods.

03

Addresses long-horizon issues in weight optimization.

Abstract

Recent advances in generative modeling enable neural networks to generate weights without relying on gradient-based optimization. However, current methods are limited by issues of over-coupling and long-horizon. The former tightly binds weight generation with task-specific objectives, thereby limiting the flexibility of the learned optimizer. The latter leads to inefficiency and low accuracy during inference, caused by the lack of local constraints. In this paper, we propose Lo-Hp, a decoupled two-stage weight generation framework that enhances flexibility through learning various optimization policies. It adopts a hybrid-policy sub-trajectory balance objective, which integrates on-policy and off-policy learning to capture local optimization policies. Theoretically, we demonstrate that learning solely local optimization policies can address the long-horizon issue while enhancing the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications