Loading paper
When Distance Distracts: Representation Distance Bias in BT-Loss for Reward Models | Tomesphere