Loading paper
MMLU-Reason: Benchmarking Multi-Task Multi-modal Language Understanding and Reasoning | Tomesphere