Designing Role Vectors to Improve LLM Inference Behaviour

Daniele Potert\`i; Andrea Seveso; Fabio Mercorio

arXiv:2502.12055·cs.CL·February 18, 2025

Designing Role Vectors to Improve LLM Inference Behaviour

Daniele Potert\`i, Andrea Seveso, Fabio Mercorio

PDF

Open Access

TL;DR

This paper introduces role vectors derived from model activations as a novel method to steer LLM behavior, demonstrating their effectiveness in improving domain-specific performance over traditional persona prompts.

Contribution

The study presents a new approach using role vectors to influence LLM behavior, showing they outperform persona-based prompting in guiding models toward domain expertise.

Findings

01

Role vectors influence model behavior and improve task performance.

02

Activation addition reinforces role-specific directions.

03

Directional ablation removes influence, affecting performance.

Abstract

The influence of personas on Large Language Models (LLMs) has been widely studied, yet their direct impact on performance remains uncertain. This work explores a novel approach to guiding LLM behaviour through role vectors, an alternative to persona-based prompting. We construct 29 role vectors derived from model activations and evaluate their impact on benchmark performance across multiple domains. Our analysis investigates whether these vectors can effectively steer models toward domain-specific expertise. We measure two key interventions: (i) activation addition, which reinforces role-specific directions, and (ii) directional ablation, which removes them. Results on well-established benchmarks indicate that role vectors do, in fact, influence model behaviour, improving task performance in relevant domains while marginally affecting unrelated tasks. This, in turn, suggests that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Natural Language Processing Techniques