SQFT: Low-cost Model Adaptation in Low-precision Sparse Foundation   Models

Juan Pablo Mu\~noz; Jinjie Yuan; Nilesh Jain

arXiv:2410.03750·cs.LG·October 8, 2024

SQFT: Low-cost Model Adaptation in Low-precision Sparse Foundation Models

Juan Pablo Mu\~noz, Jinjie Yuan, Nilesh Jain

PDF

Open Access 1 Repo 10 Models

TL;DR

SQFT introduces a low-cost, sparse, and low-precision fine-tuning method for large pre-trained models, enabling efficient adaptation in resource-limited settings while maintaining accuracy.

Contribution

The paper presents SQFT, a novel end-to-end approach for sparse, low-precision fine-tuning and merging of weights and adapters without accuracy loss.

Findings

01

Effective across multiple models and sparsity levels

02

Maintains accuracy with low-precision, sparse fine-tuning

03

Enables resource-efficient model adaptation

Abstract

Large pre-trained models (LPMs), such as large language models, have become ubiquitous and are employed in many applications. These models are often adapted to a desired domain or downstream task through a fine-tuning stage. This paper proposes SQFT, an end-to-end solution for low-precision sparse parameter-efficient fine-tuning of LPMs, allowing for effective model manipulation in resource-constrained environments. Additionally, an innovative strategy enables the merging of sparse weights with low-rank adapters without losing sparsity and accuracy, overcoming the limitations of previous approaches. SQFT also addresses the challenge of having quantized weights and adapters with different numerical precisions, enabling merging in the desired numerical format without sacrificing accuracy. Multiple adaptation scenarios, models, and comprehensive sparsity levels demonstrate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

intellabs/hardware-aware-automated-machine-learning
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSeismic Imaging and Inversion Techniques · Reservoir Engineering and Simulation Methods · Drilling and Well Engineering