ASA: Training-Free Representation Engineering for Tool-Calling Agents

Youjin Wang; Run Zhou; Rong Fu; Shuaishuai Cao; Hongwei Zeng; Jiaxuan Lu; Sicheng Fan; Jiaqiao Zhao; and Liangming Pan

arXiv:2602.04935·cs.SE·February 10, 2026

ASA: Training-Free Representation Engineering for Tool-Calling Agents

Youjin Wang, Run Zhou, Rong Fu, Shuaishuai Cao, Hongwei Zeng, Jiaxuan Lu, Sicheng Fan, Jiaqiao Zhao, and Liangming Pan

PDF

Open Access

TL;DR

This paper introduces ASA, a training-free method that improves tool-calling accuracy in language models by intervening at mid-layer activations, significantly enhancing reliability without additional training.

Contribution

We propose Activation Steering Adapter (ASA), a novel inference-time intervention technique that bridges the representation-behavior gap for tool calling in language models without training.

Findings

01

ASA increases strict tool-use F1 from 0.18 to 0.50.

02

ASA reduces false positive rate from 0.15 to 0.05.

03

Requires only 20KB of assets and no model fine-tuning.

Abstract

Adapting LLM agents to domain-specific tool calling remains notably brittle under evolving interfaces. Prompt and schema engineering is easy to deploy but often fragile under distribution shift and strict parsers, while continual parameter-efficient fine-tuning improves reliability at the cost of training, maintenance, and potential forgetting. We identify a critical Lazy Agent failure mode where tool necessity is nearly perfectly decodable from mid-layer activations, yet the model remains conservative in entering tool mode, revealing a representation-behavior gap. We propose Activation Steering Adapter (ASA), a training-free, inference-time controller that performs a single-shot mid-layer intervention and targets tool domains via a router-conditioned mixture of steering vectors with a probe-guided signed gate to amplify true intent while suppressing spurious triggers. On MTU-Bench with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware System Performance and Reliability · Software-Defined Networks and 5G · Adversarial Robustness in Machine Learning