Loading paper
Syn-GRPO: Self-Evolving Data Synthesis for MLLM Perception Reasoning | Tomesphere