Loading paper
Forward KL Regularized Preference Optimization for Aligning Diffusion Policies | Tomesphere