Loading paper
TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining | Tomesphere