Loading paper
Reward Design for Physical Reasoning in Vision-Language Models | Tomesphere