Loading paper
Shared Multi-modal Embedding Space for Face-Voice Association | Tomesphere