Loading paper
One Token per Highly Selective Frame: Towards Extreme Compression for Long Video Understanding | Tomesphere