Loading paper
PDCR: Perception-Decomposed Confidence Reward for Vision-Language Reasoning | Tomesphere