Loading paper
SADT: Combining Sharpness-Aware Minimization with Self-Distillation for Improved Model Generalization | Tomesphere