Loading paper
Beyond Binary Preferences: A Principled Framework for Reward Modeling with Ordinal Feedback | Tomesphere