Loading paper
CTC-aligned Audio-Text Embedding for Streaming Open-vocabulary Keyword Spotting | Tomesphere