Loading paper
ASAP: Attention-Shift-Aware Pruning for Efficient LVLM Inference | Tomesphere