Loading paper
Exploring Vision-Language Models for Open-Vocabulary Zero-Shot Action Segmentation | Tomesphere