Inference-time Alignment via Sparse Junction Steering

Runyi Hu; Jie Zhang; Shiqian Zhao; Jiale Meng; Jiwei Li; Jason Zeng; Ming Wu; Michael Heinrich; Yonggang Wen; Tianwei Zhang

arXiv:2602.21215·cs.CL·February 26, 2026

Inference-time Alignment via Sparse Junction Steering

Runyi Hu, Jie Zhang, Shiqian Zhao, Jiale Meng, Jiwei Li, Jason Zeng, Ming Wu, Michael Heinrich, Yonggang Wen, Tianwei Zhang

PDF

Open Access

TL;DR

This paper introduces Sparse Inference time Alignment (SIA), a method that performs targeted, sparse interventions at critical points during language model generation, significantly improving alignment efficiency and reducing computational costs.

Contribution

The paper proposes a novel sparse junction steering approach that intervenes only at high entropy decision points, outperforming dense intervention methods in alignment and efficiency.

Findings

01

Intervening on 20-80% of tokens achieves better alignment-efficiency trade-offs.

02

On models like Qwen3, as few as 20% token interventions match or surpass heavily trained models.

03

Reduces computational cost by up to 6x while maintaining or improving alignment quality.

Abstract

Token-level steering has emerged as a pivotal approach for inference-time alignment, enabling fine grained control over large language models by modulating their output distributions without parameter updates. While effective, existing methods rely on dense intervention at every decoding step. This persistent manipulation not only incurs substantial computational overhead but also risks compromising generation quality by excessively drifting from the model's intrinsic distribution. In this work, we show that dense intervention is unnecessary and propose Sparse Inference time Alignment (SIA), which performs sparse junction steering by intervening only at critical decision points along the generation trajectory. Our key insight is that high entropy junctions mark pivotal decision points in the generation trajectory and are particularly susceptible to misalignment, indicating the need to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Topic Modeling · Domain Adaptation and Few-Shot Learning