Loading paper
VidText: Towards Comprehensive Evaluation for Video Text Understanding | Tomesphere