Loading paper
ASR is all you need: cross-modal distillation for lip reading | Tomesphere