Loading paper
Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding | Tomesphere