Bilevel ZOFO: Efficient LLM Fine-Tuning and Meta-Training
Reza Shirkavand, Peiran Yu, Qi He, Heng Huang

TL;DR
Bilevel-ZOFO introduces a bilevel optimization approach combining first-order PEFT and zeroth-order updates to efficiently fine-tune large language models with improved convergence, stability, and generalization.
Contribution
This paper proposes Bilevel-ZOFO, a novel bilevel optimization method that integrates FO-PEFT and ZO techniques for efficient, stable, and generalizable LLM fine-tuning and meta-training.
Findings
Achieves 2-4 times faster training than existing methods.
Maintains similar memory efficiency while improving convergence.
Effectively combines full-model capacity with few-shot adaptation.
Abstract
Fine-tuning pre-trained Large Language Models (LLMs) for downstream tasks using First-Order (FO) optimizers presents significant computational challenges. Parameter-Efficient Fine-Tuning (PEFT) methods address these by freezing most model parameters and training only a small subset. However, PEFT often underperforms compared to full fine-tuning when high task-specific accuracy is required. Zeroth-Order (ZO) methods fine-tune the entire pre-trained model without back-propagation, estimating gradients through forward passes only. While memory-efficient, ZO methods suffer from slow convergence and high sensitivity to prompt selection. We bridge these two worlds with Bilevel-ZOFO, a bilevel optimization method that couples fast, local FO-PEFT adaptation at the inner level with stable, memory-efficient ZO updates of the full backbone at the outer level. The FO-PEFT inner loop performs fast,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMagnetic confinement fusion research
MethodsBalanced Selection
