Adaptive Principal Components Allocation with the   $\ell_{2,g}$-regularized Gaussian Graphical Model for Efficient Fine-Tuning   Large Models

Jingjing Zheng; Yankai Cao

arXiv:2412.08592·cs.LG·December 12, 2024

Adaptive Principal Components Allocation with the $\ell_{2,g}$-regularized Gaussian Graphical Model for Efficient Fine-Tuning Large Models

Jingjing Zheng, Yankai Cao

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel Gaussian Graphical Model-based parameter-efficient fine-tuning method for large models, effectively selecting critical parameters with fewer trainable parameters, demonstrated on the GLUE benchmark.

Contribution

It is the first to apply Gaussian Graphical Models to parameter-efficient fine-tuning, utilizing the $ ext{l}_{2,g}$-norm for parameter selection and global dependency capture.

Findings

01

Achieves competitive performance with fewer trainable parameters.

02

Demonstrates effectiveness on the GLUE benchmark.

03

Uses an efficient Block Coordinate Descent algorithm.

Abstract

In this work, we propose a novel Parameter-Efficient Fine-Tuning (PEFT) approach based on Gaussian Graphical Models (GGMs), marking the first application of GGMs to PEFT tasks, to the best of our knowledge. The proposed method utilizes the $ℓ_{2, g}$ -norm to effectively select critical parameters and capture global dependencies. The resulting non-convex optimization problem is efficiently solved using a Block Coordinate Descent (BCD) algorithm. Experimental results on the GLUE benchmark [24] for fine-tuning RoBERTa-Base [18] demonstrate the effectiveness of the proposed approach, achieving competitive performance with significantly fewer trainable parameters. The code for this work is available at: https://github.com/jzheng20/Course projects.git.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jzheng20/Course_projects
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGeological Modeling and Analysis · Satellite Image Processing and Photogrammetry · Medical Image Segmentation Techniques