Loading paper
The Comparative Trap: Pairwise Comparisons Amplifies Biased Preferences of LLM Evaluators | Tomesphere