Loading paper
Learning Audio-Visual embedding for Person Verification in the Wild | Tomesphere