Focus On This, Not That! Steering LLMs with Adaptive Feature Specification

Tom A. Lamb; Adam Davies; Alasdair Paren; Philip H.S. Torr; Francesco Pinto

arXiv:2410.22944·cs.LG·June 6, 2025

Focus On This, Not That! Steering LLMs with Adaptive Feature Specification

Tom A. Lamb, Adam Davies, Alasdair Paren, Philip H.S. Torr, Francesco Pinto

PDF

Open Access 1 Video

TL;DR

This paper introduces Focus Instruction Tuning (FIT), a method to train large language models to focus on specific features for more robust, fair, and controllable responses, addressing biases and improving generalization.

Contribution

The paper presents FIT, a novel training approach that enables LLMs to intrinsically focus on specified features, enhancing robustness, fairness, and controllability compared to existing post-hoc methods.

Findings

01

Successfully steers model behavior at inference time

02

Increases robustness by emphasizing core task signals

03

Reduces social bias by suppressing demographic attributes

Abstract

Despite the success of Instruction Tuning (IT) in training large language models (LLMs), such models often leverage spurious or biased features learnt from their training data and can become misaligned, leading to undesired behaviours. While existing techniques can steer model behaviour at inference-time, they are often post-hoc and do not embed steering as an intrinsic model feature. In this work, we introduce Focus Instruction Tuning (FIT), which trains LLMs to condition their responses by focusing on specific features whilst ignoring others, leading to different behaviours based on what features are specified. Across diverse benchmarks, we demonstrate that FIT: (i) successfully steers behaviour at inference time; (ii) increases robustness by amplifying core task signals and down-weighting spurious cues; (iii) mitigates social bias by suppressing demographic attributes; and (iv)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Focus On This, Not That! Steering LLMs with Adaptive Feature Specification· slideslive

Taxonomy

TopicsDigital Rights Management and Security · Business Process Modeling and Analysis · Semantic Web and Ontologies

MethodsFocus