Loading paper
Mixtraining: A Better Trade-Off Between Compute and Performance | Tomesphere