Enhancing Cross-task Transfer of Large Language Models via Activation Steering

Xinyu Tang; Zhihao Lv; Xiaoxue Cheng; Junyi Li; Wayne Xin Zhao; Zujie Wen; Zhiqiang Zhang; Jun Zhou

arXiv:2507.13236·cs.CL·July 18, 2025

Enhancing Cross-task Transfer of Large Language Models via Activation Steering

Xinyu Tang, Zhihao Lv, Xiaoxue Cheng, Junyi Li, Wayne Xin Zhao, Zujie Wen, Zhiqiang Zhang, Jun Zhou

PDF

Open Access

TL;DR

This paper introduces CAST, a novel method that improves cross-task transfer in large language models by steering internal activations, enhancing performance in low-resource and cross-lingual tasks without additional parameter updates.

Contribution

The paper proposes a new activation steering framework that leverages latent space manipulation for effective transfer across tasks without parameter fine-tuning.

Findings

01

Outperforms baseline methods in cross-domain transfer tasks

02

Demonstrates superior scalability and efficiency

03

Effective in low-resource and cross-lingual scenarios

Abstract

Large language models (LLMs) have shown impressive abilities in leveraging pretrained knowledge through prompting, but they often struggle with unseen tasks, particularly in data-scarce scenarios. While cross-task in-context learning offers a direct solution for transferring knowledge across tasks, it still faces critical challenges in terms of robustness, scalability, and efficiency. In this paper, we investigate whether cross-task transfer can be achieved via latent space steering without parameter updates or input expansion. Through an analysis of activation patterns in the latent space of LLMs, we observe that the enhanced activations induced by in-context examples have consistent patterns across different tasks. Inspired by these findings, we propose CAST, a novel Cross-task Activation Steering Transfer framework that enables effective transfer by manipulating the model's internal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Topic Modeling · Speech and dialogue systems