Loading paper
Target Speaker Lipreading by Audio-Visual Self-Distillation Pretraining and Speaker Adaptation | Tomesphere