Loading paper
Improving Query-by-Vocal Imitation with Contrastive Learning and Audio Pretraining | Tomesphere