Exploring Diverse Generation Paths via Inference-time Stiefel Activation Steering
Dongxuan Zhu, Ly Tran Ho Khanh, Andy Yat-Ming Cheung, Man-Chung Yue, Viet Anh Nguyen

TL;DR
This paper introduces STARS, a training-free, inference-time method that enhances diversity in language model outputs by optimizing activation steering directions on the Stiefel manifold, promoting divergent generation paths.
Contribution
The paper proposes a novel, manifold-based activation steering technique for inference-time diversity promotion in language models, without additional training.
Findings
STARS outperforms standard sampling methods in diversity metrics.
It achieves greater output diversity without losing qualitative quality.
The method is efficient enough for real-time inference.
Abstract
Language models often default to a narrow set of high-probability outputs, leaving their generation paths homogeneous and prone to mode collapse. Sampling-based strategies inject randomness but still struggle to guarantee diversity across multiple concurrent generation runs. We address this limitation by introducing STARS (iefel-based ctivation Steering for Diverse eaoning), a training-free, inference-time intervention method that transforms activation steering into an exploration engine. At each token, STARS collects the hidden activations of concurrent generation runs and optimizes multiple additive steering directions jointly on the Stiefel manifold. STARS maximizes the geometric volume of the steered activations, while the Stiefel manifold induces orthogonality of the steering interventions. This formulation explicitly promotes…
Peer Reviews
Decision·ICLR 2026 Poster
- The paper is very well and pedagogically written - The method is an inference-time method and as such training free, saving compute overhead and making it accessible - The theoretical part of the paper is strong, from guaranteeing existence of a solution to providing an algorithm for finding the optimal solution and deriving convergence guarantees - The paper proposes a practical algorithm to approximate the algorithm guaranteed to find the optimal solution with a more lightweight approach fo
- One key element of the method, namely the constraint on $V$ to be an orthogonal matrix, is not motivated too well. Only in line 201 it briefly says “To encourage diversity between different generations, we require the steering vectors to be orthogonal with each other, i.e. […]”. But why does this constraint ensure diversity? Is it not rather the orthogonality of the columns of the resulting $H + V$ that would maximise diversity and the objective? - In the experiments, the comparison to basel
- The paper addresses an important and timely topic—enhancing diversity in large language model reasoning. - The idea of using training-free steering vectors is particularly interesting, as it offers a lightweight and potentially generalizable approach to influencing model behavior without fine-tuning.
- The evaluation is somewhat limited, relying primarily on a single “sampling” method as the baseline. Comparing only against one approach makes it difficult to assess the broader effectiveness of the proposed method. - Additionally, the reported results, while suggestive, are not particularly strong or conclusive. A more comprehensive experimental section would strengthen the paper’s empirical claims.
**S1:** I believe that studying diversity is somehow lacking in the current research efforts. The applications of "diversity increase" are underexplored, and could lead to improvements of generative models both at inference (reduce bias, increase creativity, etc.) and at training time. This work tackles the topic at its core, which is refreshing. **S2:** This work proposes to use steering to induce diversity. To the best of my knowledge, this is an unexplored application of steering, which I de
**W1:** STAR applies to a single layer of the model by construction. This is a fundamental drawback in my opinion. First, the best layer must be found in advance, as the authors have done in Tables 3,4. Second, previous work has shown that intervening carefully on all layers is more effective ([Rodriguez et al. NeurIPS 2025](https://arxiv.org/abs/2503.10679)). Additionally, while being a common choice, intervening at the output of attention layers is less effective than intervening on the residu
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage and cultural evolution · Topic Modeling · Machine Learning in Materials Science
