Loading paper
Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling | Tomesphere