Spectrum: Targeted Training on Signal to Noise Ratio
Eric Hartford, Lucas Atkins, Fernando Fernandes Neto, David, Golchinfar

TL;DR
Spectrum is a novel training method for large language models that selectively fine-tunes modules based on their signal-to-noise ratio, reducing computational resources while maintaining performance.
Contribution
It introduces a SNR-based module selection algorithm for targeted training, enabling efficient fine-tuning with less memory and comparable results to full fine-tuning.
Findings
Spectrum matches full fine-tuning performance
Reduces GPU memory usage significantly
Outperforms existing methods like QLoRA in efficiency
Abstract
Efficiently post-training large language models remains a challenging task due to the vast computational resources required. We present Spectrum, a method that accelerates LLM training by selectively targeting layer modules based on their signal-to-noise ratio (SNR), and freezing the remaining modules. Our approach, which utilizes an algorithm to compute module SNRs prior to training, has shown to effectively match the performance of full fine-tuning while reducing GPU memory usage. Experiments comparing Spectrum to existing methods such as QLoRA demonstrate its effectiveness in terms of model quality and VRAM efficiency in distributed environments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗arcee-ai/Llama-3-SEC-Chatmodel· 6 dl· ♡ 376 dl♡ 37
- 🤗arcee-ai/Arcee-Agentmodel· 58 dl· ♡ 9358 dl♡ 93
- 🤗crusoeai/Arcee-Agent-GGUFmodel· 555 dl· ♡ 7555 dl♡ 7
- 🤗lucyknada/arcee-ai_Arcee-Agent-exl2-6.0bpwmodel· 1 dl1 dl
- 🤗QuantFactory/Arcee-Agent-GGUFmodel· 65 dl· ♡ 265 dl♡ 2
- 🤗akjindal53244/Llama-3.1-Storm-8Bmodel· 2.2k dl· ♡ 1772.2k dl♡ 177
- 🤗akjindal53244/Llama-3.1-Storm-8B-FP8-Dynamicmodel· 9 dl· ♡ 149 dl♡ 14
- 🤗akjindal53244/Llama-3.1-Storm-8B-GGUFmodel· 237 dl· ♡ 41237 dl♡ 41
- 🤗RichardErkhov/akjindal53244_-_Llama-3.1-Storm-8B-ggufmodel· 102 dl· ♡ 2102 dl♡ 2
- 🤗QuantFactory/Llama-3.1-Storm-8B-GGUFmodel· 37 dl· ♡ 237 dl♡ 2
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Sensor Technology and Measurement Systems
