SALSA: Single-pass Autoregressive LLM Structured Classification

Ruslan Berdichevsky; Shai Nahum-Gefen; Elad Ben Zaken

arXiv:2510.22691·cs.CL·October 28, 2025

SALSA: Single-pass Autoregressive LLM Structured Classification

Ruslan Berdichevsky, Shai Nahum-Gefen, Elad Ben Zaken

PDF

TL;DR

SALSA is a novel classification method that uses structured prompting and class-to-token mapping to enable efficient, single-pass inference with large language models, achieving state-of-the-art results.

Contribution

It introduces a new pipeline combining structured prompting, class-to-token mapping, and parameter-efficient fine-tuning for LLM classification.

Findings

01

Achieves state-of-the-art results on multiple benchmarks.

02

Enables single-pass inference without cold-start training.

03

Demonstrates robustness and scalability in LLM classification.

Abstract

Despite their impressive generalization capabilities, instruction-tuned Large Language Models often underperform on text classification benchmarks. We introduce SALSA, a coherent pipeline that combines structured prompting, class-to-token mapping, and parameter-efficient fine-tuning, thereby avoiding cold-start training. Each class label is mapped to a distinct output token, and prompts are constructed to elicit a single-token response. During inference, the model's output is projected only onto the logits of the relevant class tokens, enabling efficient and accurate classification in a single forward pass. SALSA achieves state-of-the-art results across diverse benchmarks, demonstrating its robustness and scalability for LLM-based classification applications.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.