Loading paper
Sharpness-Aware Minimization in Logit Space Efficiently Enhances Direct Preference Optimization | Tomesphere