Loading paper
MCTBench: Multimodal Cognition towards Text-Rich Visual Scenes Benchmark | Tomesphere