Loading paper
Joint Multimodal Contrastive Learning for Robust Spoken Term Detection and Keyword Spotting | Tomesphere