Loading paper
HIPPO: Accelerating Video Large Language Models Inference via Holistic-aware Parallel Speculative Decoding | Tomesphere