LLM as a Complementary Optimizer to Gradient Descent: A Case Study in   Prompt Tuning

Zixian Guo; Ming Liu; Zhilong Ji; Jinfeng Bai; Yiwen Guo; Wangmeng Zuo

arXiv:2405.19732·cs.CV·December 5, 2024·1 cites

LLM as a Complementary Optimizer to Gradient Descent: A Case Study in Prompt Tuning

Zixian Guo, Ming Liu, Zhilong Ji, Jinfeng Bai, Yiwen Guo, Wangmeng Zuo

PDF

Open Access 1 Repo

TL;DR

This paper proposes a novel collaborative optimization framework combining gradient descent and LLM-based inference, demonstrating improved prompt tuning performance through their complementary strengths.

Contribution

It introduces a new method that alternates between gradient-based and LLM-based optimization, leveraging their synergy for better prompt tuning results.

Findings

01

Combined optimization outperforms individual methods on various tasks.

02

LLMs effectively generate improved solutions based on gradient trajectories.

03

Synergistic approach yields consistent performance gains.

Abstract

Mastering a skill generally relies on both hands-on experience from doers and insightful, high-level guidance by mentors. Will this strategy also work well for solving complex non-convex optimization problems? Here, a common gradient-based optimizer acts like a disciplined doer, making locally optimal updates at each step. Large Language Models (LLMs) can also search for better solutions by inferring from natural language instructions, akin to a high-level mentor. In this paper, we show that these two participators are complementary to each other and can effectively collaborate as a combined optimization framework. The collaborative optimization is achieved by alternating between the gradient-based and LLM-based optimizers. We instruct LLMs to generate possibly improved solutions by taking parameter trajectories recorded during the previous stage of gradient-based optimization into…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

guozix/llm-catalyst
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsManufacturing Process and Optimization