Loading paper
Towards Joint Quantization and Token Pruning of Vision-Language Models | Tomesphere