Virtual Parameter Sharpening: Dynamic Low-Rank Perturbations for Inference-Time Reasoning Enhancement

Saba Kublashvili

arXiv:2602.19169·cs.LG·February 24, 2026

Virtual Parameter Sharpening: Dynamic Low-Rank Perturbations for Inference-Time Reasoning Enhancement

Saba Kublashvili

PDF

Open Access

TL;DR

Virtual Parameter Sharpening (VPS) is a novel inference-time method that dynamically augments transformer layers with activation-conditioned low-rank perturbations, enabling test-time adaptation and reasoning enhancement without parameter updates.

Contribution

VPS introduces a dynamic, activation-conditioned low-rank perturbation technique for inference-time reasoning enhancement in transformers, differing from static fine-tuning methods like LoRA.

Findings

01

VPS effectively improves reasoning capabilities in large language models.

02

Theoretical analysis reveals spectral properties of the perturbations.

03

Adaptive policies modulate perturbation strength based on activation energy.

Abstract

I introduce Virtual Parameter Sharpening (VPS), an inference-time technique that augments frozen transformer linear layers with dynamic, activation-conditioned low-rank perturbations. Unlike parameter-efficient fine-tuning methods such as LoRA, which learn static low-rank adapters, VPS constructs its perturbation factors on the fly from batch activation statistics and optional gradient signals, enabling test-time adaptation without persistent parameter updates. The perturbation takes the form Delta W = gamma * W^T V U^T W, where selector matrices U and V are constructed via sparse activation-guided selection or Sylvester-coupled regression. We provide a theoretical analysis of the perturbation's spectral properties and describe an adaptive policy system that modulates perturbation magnitude based on activation energy and token-level entropy. This system incorporates multi-objective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Model Reduction and Neural Networks · Neural Networks and Reservoir Computing