TL;DR
The paper introduces One-for-All, a parameter-efficient, stable, and lightweight LLM adaptation method for time series forecasting, achieving state-of-the-art efficiency-accuracy trade-offs and enabling edge deployment.
Contribution
It proposes rsLoRA, a mathematically grounded rank-stabilization mechanism for low-rank adapters, reducing parameters and memory while maintaining accuracy.
Findings
Achieves 5.5× higher parameter efficiency than TimesNet.
Matches forecasting accuracy of larger models like GPT4TS.
Reduces memory footprint by up to 1776×.
Abstract
We address the challenge of adapting pre-trained Large Language Models (LLMs) for multivariate time-series analysis, where their deployment is often hindered by prohibitive computational and memory demands. Our solution, One-for-All, introduces Gaussian Rank-Stabilized Low-Rank Adapters (rsLoRA) to enable parameter-efficient fine-tuning of frozen LLMs. While inspired by LoRA, rsLoRA introduces a mathematically grounded rank-stabilization mechanism that enables provable gradient stability at low ranks a novel contribution absent in prior PEFT methods. Our framework injects trainable rank decomposition matrices (rank 16) into positional embeddings and output layers, while keeping self-attention weights fixed. This design reduces trainable parameters by 6.8 (vs. TimesNet), 21 (vs. GPT4TS), and 11.8 (vs. TIME-LLM), while achieving a 168-1,776 smaller memory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
