Loading paper
Audio-visual Speaker Recognition with a Cross-modal Discriminative Network | Tomesphere