SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment

Yixuan Tang; Yi Yang

arXiv:2512.02807·cs.CL·December 3, 2025

SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment

Yixuan Tang, Yi Yang

PDF

Open Access

TL;DR

This paper introduces stable rank as an intrinsic, annotation-free quality measure derived from model representations, and uses it as a reward signal in reinforcement learning to improve large language model alignment without external supervision.

Contribution

The paper proposes stable rank as a novel intrinsic quality signal and demonstrates its effectiveness in reinforcement learning for LLM alignment, outperforming existing methods.

Findings

01

Stable rank achieves 84.04% accuracy on RewardBench.

02

SR-GRPO improves task accuracy by 11.3 percentage points.

03

SR-GRPO outperforms reward models and self-evaluation baselines.

Abstract

Aligning Large Language Models (LLMs) with human preferences typically relies on external supervision, which faces critical limitations: human annotations are scarce and subjective, reward models are vulnerable to reward hacking, and self-evaluation methods suffer from prompt sensitivity and biases. In this work, we propose stable rank, an intrinsic, annotation-free quality signal derived from model representations. Stable rank measures the effective dimensionality of hidden states by computing the ratio of total variance to dominant-direction variance, capturing quality through how information distributes across representation dimensions. Empirically, stable rank achieves 84.04% accuracy on RewardBench and improves task accuracy by an average of 11.3 percentage points over greedy decoding via Best-of-N sampling. Leveraging this insight, we introduce Stable Rank Group Relative Policy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Artificial Intelligence in Healthcare and Education