Loading paper
Small Vision-Language Models are Smart Compressors for Long Video Understanding | Tomesphere