Loading paper
Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference | Tomesphere