Loading paper
Static Batching of Irregular Workloads on GPUs: Framework and Application to Efficient MoE Model Inference | Tomesphere