Loading paper
VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression? | Tomesphere