Loading paper
Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following | Tomesphere