Loading paper
Complete Cross-triplet Loss in Label Space for Audio-visual Cross-modal Retrieval | Tomesphere