Test-time Diverse Reasoning by Riemannian Activation Steering
Ly Tran Ho Khanh, Dongxuan Zhu, Man-Chung Yue, Viet Anh Nguyen

TL;DR
This paper introduces a novel test-time activation steering method using Riemannian optimization to enhance output diversity and accuracy in language model reasoning tasks.
Contribution
It proposes an unsupervised Riemannian activation steering strategy that improves diversity in model outputs during inference by optimizing steering vectors at synchronization points.
Findings
Outperforms vanilla sampling in diversity and accuracy on mathematical benchmarks.
Uses Riemannian optimization to find steering vectors that maximize output variance.
Demonstrates effectiveness of test-time activation steering in complex reasoning tasks.
Abstract
Best-of- reasoning improves the accuracy of language models in solving complex tasks by sampling multiple candidate solutions and then selecting the best one based on some criteria. A critical bottleneck for this strategy is the output diversity limit, which occurs when the model generates similar outputs despite stochastic sampling, and hence recites the same error. To address this lack of variance in reasoning paths, we propose a novel unsupervised activation steering strategy that simultaneously optimizes the steering vectors for multiple reasoning trajectories at test time. At any synchronization anchor along the batch generation process, we find the steering vectors that maximize the total volume spanned by all possible intervened activation subsets. We demonstrate that these steering vectors can be determined by solving a Riemannian optimization problem over the product of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning in Materials Science · Advanced Graph Neural Networks
