Loading paper
SEED-Bench-2: Benchmarking Multimodal Large Language Models | Tomesphere