Loading paper
Where to Spend Rollouts: Hit-Utility Optimal Rollout Allocation for Group-Based RLVR | Tomesphere