Loading paper
Rationale Matters: Learning Transferable Rubrics via Proxy-Guided Critique for VLM Reward Models | Tomesphere