CLaS-Bench: A Cross-Lingual Alignment and Steering Benchmark

Daniil Gurgurov; Yusser Al Ghussin; Tanja Baeumel; Cheng-Ting Chou; Patrick Schramowski; Marius Mosbach; Josef van Genabith; Simon Ostermann

arXiv:2601.08331·cs.CL·January 14, 2026

CLaS-Bench: A Cross-Lingual Alignment and Steering Benchmark

Daniil Gurgurov, Yusser Al Ghussin, Tanja Baeumel, Cheng-Ting Chou, Patrick Schramowski, Marius Mosbach, Josef van Genabith, Simon Ostermann

PDF

Open Access 1 Datasets

TL;DR

CLaS-Bench is a new benchmark for evaluating multilingual steering techniques in large language models, enabling systematic assessment across 32 languages and revealing insights into language-specific structures and effective steering methods.

Contribution

It introduces CLaS-Bench, the first standardized benchmark for multilingual steering, and provides comprehensive evaluation of various steering techniques across multiple languages.

Findings

01

Residual-based DiffMean method outperforms others

02

Language-specific structures emerge in later layers

03

Steering directions cluster by language family

Abstract

Understanding and controlling the behavior of large language models (LLMs) is an increasingly important topic in multilingual NLP. Beyond prompting or fine-tuning, , i.e.,~manipulating internal representations during inference, has emerged as a more efficient and interpretable technique for adapting models to a target language. Yet, no dedicated benchmarks or evaluation protocols exist to quantify the effectiveness of steering techniques. We introduce CLaS-Bench, a lightweight parallel-question benchmark for evaluating language-forcing behavior in LLMs across 32 languages, enabling systematic evaluation of multilingual steering methods. We evaluate a broad array of steering techniques, including residual-stream DiffMean interventions, probe-derived directions, language-specific neurons, PCA/LDA vectors, Sparse Autoencoders, and prompting baselines. Steering performance is measured along…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

DGurgurov/CLaS-Bench
dataset· 42 dl
42 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Computational and Text Analysis Methods · Multimodal Machine Learning Applications