Loading paper
PerPO: Perceptual Preference Optimization via Discriminative Rewarding | Tomesphere