Loading paper
Scaling Vision Transformers: Evaluating DeepSpeed for Image-Centric Workloads | Tomesphere