The Scaling Law for LoRA Base on Mutual Information Upper Bound

Jing Zhang; Hui Gao; Peng Zhang; Shuzhen Sun; Chang Yang; Yuexian Hou

arXiv:2501.03152·cs.LG·January 7, 2025

The Scaling Law for LoRA Base on Mutual Information Upper Bound

Jing Zhang, Hui Gao, Peng Zhang, Shuzhen Sun, Chang Yang, Yuexian Hou

PDF

Open Access

TL;DR

This paper introduces a mutual information upper bound metric to better understand and predict the performance scaling law of LoRA fine-tuning in large models, outperforming traditional external metrics.

Contribution

The paper proposes a novel internal metric based on MIUB theory to analyze the scaling law of LoRA fine-tuning, validated on large models and benchmark datasets.

Findings

01

MIUB metric aligns more accurately with the scaling law

02

Outperforms cross-entropy and perplexity in stability

03

Validated on Llama3-8B and Phi3-3B models

Abstract

LoRA (Low-Rank Adaptation) is a widely used model fine-tuning method. In fine-tuning, the law among model performance, model parameters, and data complexity has been a focal issue in the field. Existing methods often leverage external metrics (such as cross-entropy or perplexity) to evaluate model performance. In the fine-tuning process for large models, two types of knowledge are typically involved: the frozen, general knowledge acquired by the model during pre-training and the new knowledge learned through the LoRA module from the current data. Generally, the less LoRA's learned knowledge relies on the large model, the more it captures the specific knowledge of new data, thereby enhancing its adaptability to new tasks. However, external metrics do not readily capture the dependency relationship between these two types of knowledge. Therefore, we designed an internal metric based on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCognitive Computing and Networks · Distributed systems and fault tolerance · Advanced Database Systems and Queries