Loading paper
EventSTU: Event-Guided Efficient Spatio-Temporal Understanding for Video Large Language Models | Tomesphere