SLOT: Sample-specific Language Model Optimization at Test-time
Yang Hu, Xingyu Zhang, Xueji Fang, Zhiyang Chen, Xiao Wang, Huatian Zhang, Guojun Qi

TL;DR
SLOT is a test-time optimization method that adapts large language models to individual prompts by updating a lightweight parameter vector, significantly improving their accuracy on complex tasks.
Contribution
This paper introduces SLOT, a parameter-efficient test-time adaptation technique that enhances LLM performance on specific prompts through few optimization steps.
Findings
Outperforms baseline models on multiple benchmarks.
Qwen2.5-7B with SLOT improves GSM8K accuracy by 8.6%.
DeepSeek-R1-Distill-Llama-70B with SLOT achieves SOTA on GPQA.
Abstract
We propose SLOT (Sample-specific Language Model Optimization at Test-time), a novel and parameter-efficient test-time inference approach that enhances a language model's ability to more accurately respond to individual prompts. Existing Large Language Models (LLMs) often struggle with complex instructions, leading to poor performances on those not well represented among general samples. To address this, SLOT conducts few optimization steps at test-time to update a light-weight sample-specific parameter vector. It is added to the final hidden layer before the output head, and enables efficient adaptation by caching the last layer features during per-sample optimization. By minimizing the cross-entropy loss on the input prompt only, SLOT helps the model better aligned with and follow each given instruction. In experiments, we demonstrate that our method outperforms the compared models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Data Classification
