SQFT: Low-cost Model Adaptation in Low-precision Sparse Foundation Models
Juan Pablo Mu\~noz, Jinjie Yuan, Nilesh Jain

TL;DR
SQFT introduces a low-cost, sparse, and low-precision fine-tuning method for large pre-trained models, enabling efficient adaptation in resource-limited settings while maintaining accuracy.
Contribution
The paper presents SQFT, a novel end-to-end approach for sparse, low-precision fine-tuning and merging of weights and adapters without accuracy loss.
Findings
Effective across multiple models and sparsity levels
Maintains accuracy with low-precision, sparse fine-tuning
Enables resource-efficient model adaptation
Abstract
Large pre-trained models (LPMs), such as large language models, have become ubiquitous and are employed in many applications. These models are often adapted to a desired domain or downstream task through a fine-tuning stage. This paper proposes SQFT, an end-to-end solution for low-precision sparse parameter-efficient fine-tuning of LPMs, allowing for effective model manipulation in resource-constrained environments. Additionally, an innovative strategy enables the merging of sparse weights with low-rank adapters without losing sparsity and accuracy, overcoming the limitations of previous approaches. SQFT also addresses the challenge of having quantized weights and adapters with different numerical precisions, enabling merging in the desired numerical format without sacrificing accuracy. Multiple adaptation scenarios, models, and comprehensive sparsity levels demonstrate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗IntelLabs/sqft-phi-3-mini-4k-50-basemodel· 775 dl· ♡ 2775 dl♡ 2
- 🤗IntelLabs/sqft-phi-3-mini-4k-60-basemodel· 19 dl· ♡ 219 dl♡ 2
- 🤗IntelLabs/sqft-phi-3-mini-4k-30-basemodel· 14 dl· ♡ 214 dl♡ 2
- 🤗IntelLabs/sqft-phi-3-mini-4k-40-basemodel· 12 dl· ♡ 212 dl♡ 2
- 🤗IntelLabs/sqft-phi-3-mini-4k-50-base-gptqmodel· 767 dl· ♡ 2767 dl♡ 2
- 🤗IntelLabs/sqft-mistral-7b-v0.3-50-basemodel· 713 dl· ♡ 2713 dl♡ 2
- 🤗IntelLabs/sqft-mistral-7b-v0.3-50-base-gptqmodel· 698 dl· ♡ 2698 dl♡ 2
- 🤗IntelLabs/sqft-sparsepeft-mistral-7b-v0.3-50-gsm8k-heumodel· 6 dl· ♡ 26 dl♡ 2
- 🤗IntelLabs/sqft-qa-sparsepeft-mistral-7b-v0.3-50-gptq-gsm8k-heumodel· 6 dl· ♡ 26 dl♡ 2
- 🤗IntelLabs/sqft-mistral-7b-v0.3-50-gptq-math-heu-adaptermodel· 4 dl· ♡ 24 dl♡ 2
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSeismic Imaging and Inversion Techniques · Reservoir Engineering and Simulation Methods · Drilling and Well Engineering
