Loading paper
ActAlign: Zero-Shot Fine-Grained Video Classification via Language-Guided Sequence Alignment | Tomesphere