Loading paper
Variational Student: Learning Compact and Sparser Networks in Knowledge Distillation Framework | Tomesphere