Bilevel ZOFO: Efficient LLM Fine-Tuning and Meta-Training

Reza Shirkavand; Peiran Yu; Qi He; Heng Huang

arXiv:2502.03604·cs.LG·December 16, 2025

Bilevel ZOFO: Efficient LLM Fine-Tuning and Meta-Training

Reza Shirkavand, Peiran Yu, Qi He, Heng Huang

PDF

Open Access

TL;DR

Bilevel-ZOFO introduces a bilevel optimization approach combining first-order PEFT and zeroth-order updates to efficiently fine-tune large language models with improved convergence, stability, and generalization.

Contribution

This paper proposes Bilevel-ZOFO, a novel bilevel optimization method that integrates FO-PEFT and ZO techniques for efficient, stable, and generalizable LLM fine-tuning and meta-training.

Findings

01

Achieves 2-4 times faster training than existing methods.

02

Maintains similar memory efficiency while improving convergence.

03

Effectively combines full-model capacity with few-shot adaptation.

Abstract

Fine-tuning pre-trained Large Language Models (LLMs) for downstream tasks using First-Order (FO) optimizers presents significant computational challenges. Parameter-Efficient Fine-Tuning (PEFT) methods address these by freezing most model parameters and training only a small subset. However, PEFT often underperforms compared to full fine-tuning when high task-specific accuracy is required. Zeroth-Order (ZO) methods fine-tune the entire pre-trained model without back-propagation, estimating gradients through forward passes only. While memory-efficient, ZO methods suffer from slow convergence and high sensitivity to prompt selection. We bridge these two worlds with Bilevel-ZOFO, a bilevel optimization method that couples fast, local FO-PEFT adaptation at the inner level with stable, memory-efficient ZO updates of the full backbone at the outer level. The FO-PEFT inner loop performs fast,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMagnetic confinement fusion research

MethodsBalanced Selection