Loading paper
Automatic Channel Pruning for Multi-Head Attention | Tomesphere