Loading paper
SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward | Tomesphere