LLM as a Complementary Optimizer to Gradient Descent: A Case Study in Prompt Tuning
Zixian Guo, Ming Liu, Zhilong Ji, Jinfeng Bai, Yiwen Guo, Wangmeng Zuo

TL;DR
This paper proposes a novel collaborative optimization framework combining gradient descent and LLM-based inference, demonstrating improved prompt tuning performance through their complementary strengths.
Contribution
It introduces a new method that alternates between gradient-based and LLM-based optimization, leveraging their synergy for better prompt tuning results.
Findings
Combined optimization outperforms individual methods on various tasks.
LLMs effectively generate improved solutions based on gradient trajectories.
Synergistic approach yields consistent performance gains.
Abstract
Mastering a skill generally relies on both hands-on experience from doers and insightful, high-level guidance by mentors. Will this strategy also work well for solving complex non-convex optimization problems? Here, a common gradient-based optimizer acts like a disciplined doer, making locally optimal updates at each step. Large Language Models (LLMs) can also search for better solutions by inferring from natural language instructions, akin to a high-level mentor. In this paper, we show that these two participators are complementary to each other and can effectively collaborate as a combined optimization framework. The collaborative optimization is achieved by alternating between the gradient-based and LLM-based optimizers. We instruct LLMs to generate possibly improved solutions by taking parameter trajectories recorded during the previous stage of gradient-based optimization into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsManufacturing Process and Optimization
