Loading paper
Short-LVLM: Compressing and Accelerating Large Vision-Language Models by Pruning Redundant Layers | Tomesphere