Loading paper
Energy-Based Preference Model Offers Better Offline Alignment than the Bradley-Terry Preference Model | Tomesphere