Loading paper
Modality-Aware Zero-Shot Pruning and Sparse Attention for Efficient Multimodal Edge Inference | Tomesphere