Loading paper
Sound event detection with audio-text models and heterogeneous temporal annotations | Tomesphere