Improving Multilingual Language Models by Aligning Representations through Steering

Omar Mahmoud; Buddhika Laknath Semage; Thommen George Karimpanal; and Santu Rana

arXiv:2505.12584·cs.CL·August 27, 2025

Improving Multilingual Language Models by Aligning Representations through Steering

Omar Mahmoud, Buddhika Laknath Semage, Thommen George Karimpanal, and Santu Rana

PDF

Open Access

TL;DR

This paper introduces a lightweight representation steering method that improves multilingual performance of large language models by aligning internal representations, achieving results comparable to translation systems with fewer resources.

Contribution

The paper presents a novel, efficient intervention technique using representation steering to enhance multilingual capabilities of LLMs, outperforming many existing methods.

Findings

01

Outperforms most baseline methods in multilingual tasks

02

Achieves translation-quality performance with fewer resources

03

Complementary to supervised fine-tuning, enhancing internal representations

Abstract

This paper investigates how Large Language Models (LLMs) represent non-English tokens -- a question that remains underexplored despite recent progress. We propose a lightweight intervention method using representation steering, where a learned vector is added to the residual stream at a single model layer to enhance multilingual performance. Through extensive experiments across seven competitive baselines -- including prompt optimization, supervised fine-tuning (SFT), in-context learning, cross-lingual transfer, and translation-based methods-we show that our approach consistently outperforms most alternatives. In particular, it achieves performance on par with production-grade translation systems while requiring far fewer resources. We further explore the complementarity between our method and SFT, demonstrating that steering offers a direct, efficient way to realign internal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Generative Adversarial Networks and Image Synthesis

MethodsALIGN